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(57) Abstract 



Novel mammalian phospholipase (PLA2) nucleotide sequences and low molecular weight (about 14 KD) amino acid sequences 
encoded thereby are disclosed. More particularly, a cloned human HPLA2 cDNA expressing HPLA2-IO and its cloned rat RPLA2 cDNA 
counterpart, expressing RPLA2-IO, which are characterized as PLA2 Type IV, are disclosed. A second type of PLA2 cDNA, characterized as 
PLA2 Type ni and exemplified by a rat PLA2 cDNA encoding RPLA2-8 and a partial human PLA2 nucleotide sequence encoding HPLA2-8, 
is disclosed. Expression of the cDNAs encode the two new types of PLA2 enzymes which have phospholipase activity. The novel PLA2S 
do not include disulfide bridges between cysteine amino acids 1 1 and 77 or elapid loops. However, tfie novel PLA2S may include amino 
acid COOH-terminal extensions which can vary in length. Seventeen of the eighteen absolutely conserved amino acids in all active 14 
KD PLA2S are believed to be conserved in RPLA2-8 and HPLA2-8, whereas all eighteen are believed to be conserved in RPLA2-IO and 
HPLA^IO. Because the encoded sequences of RPLA2-8 and HPLA2-8 include only 16 cysteine amino acids, they have been designated as 
Type in. RPLA2-IO and HPLA2-IO are designated as Type IV since their encoded sequences include only 12 cysteine amino acids. 
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MAMMALIAN LOW MOLECULAR WEIGHT PHOSPHOLIPASE A2 NUCLEOTIDE AND 
AMINO ACID SEQUENCES 



5 This application is a continuation in part of 

U,S. Serial No. 08/091,941, filed July 15, 1993, entitled 
MAMMALIAN PHOSPHOLIPASE A2 NUCLEOTIDE SEQUENCES AND LOW 
MOLECULAR WEIGHT AMINO ACID SEQUENCES ENCODED THEREBY, 
Field of the Invention 

^0 The present invention relates to novel 

mammalian phospholipase A2 nucleotide sequences, low 
molecular weight (approximately 14KD) amino acid sequences 
encoded thereby, clones and vectors which include the 
mammalian phospholipase Aj nucleotide sequences, antisense 

15 nucleotide sequences complementary to the genes and mRNA 

transcripts encoding for the phospholipase amino acid 
sequences, nucleotide sequences having internal ribosome 
binding sites which allow for internal initiation of mRNA 
cap- independent translation, and cell lines. 

20 Backcnround 

Phospholipase A^s - phosphatide 2-acyl- 
hydrolase, EC 3.1.1.4 (hereinafter "PLA^") constitute a 
diverse family of enzymes that hydrolyze the sn-2 fatty 
acyl ester bond of phosphogylcerides , producing 
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free fatty acid and lysophospholipids . See Dennis, 
E.A. Phospholiphases- In: The Enzymes , edited by 
Boyer, P. New York: Academic Press, p. 307-353 
(1983). Over the past two decades, PL1A2 activities 
have been purified and characterized from different 
tissues, cultured cells, and exudates from different 
mammals. See Rordorf, G. et al.: J. Neuroscience . 
11:1829-1826 (1991)? Seilhamer, J-J. et al,: J. 
Biochem. . 106:38-42 (1989); Langlais J. et al,: 
Biocham. & Biophvs. Res. Comm. . 182:208-214 (1992); 
Murakami, M. et al.: J. Biochem, . 111:175-181 (1992); 
and Jordan, L.M. et al.: J. Chromat. . 597:299-308 
(1992) . These enzymes have been found to vary in 
molecular weight, optimal pH, Ca^"*" dependence, 
substrate specificity, and solubility. 

To date, two classes of unrelated PIiA2S 
have been reported. One is a family of low molecular 
mass, approximately 14}cDa PLA2S which are 
characterized by a rigid three dimensional structure 
maintained by disulfide bridges and a catalytic 
requirement for Ca^^. The other is a high molecular 
mass, 82kDa, intracellular PIiA2 found in the 
cytosolic subcellular fraction in the absence of 
calcium, but associated with cellular membranes at 
calcixim concentrations from 0,1 to lOpM. See Clark, 
J.D. et al.: Cell . 65:1043-1051 (1991) and Sharp, 
J.D. et al.: J. Biol. Chem. . 266:14850-14853 (1991). 



SUBSTITUTE SHEET (RULE 26) 



wo 9S/02328 PCT/US94/07926 

-3- 

In addition, several Ca-^-^-insensitive PLAj activities 
are believed to exist, however, it is also believed 
that as yet none of the genes encoding such 
activities have been cloned. 
5 m terms of structure, low molecular 

weight, e.g., about 14kDa, PLAjS i^ank among the best 
characterized enzymes. Complete primary sequences 
have been determined for more than 50 PIAjS from 
organisms such as snakes, bees and humans. See 
10 Heinrikson, R.L.: Methods in Enzvmoloav. 197:201-214 

(1991); Davidson, F.F. et al.: J ■ Mol . Evolution, 

31:228-238 (1990) ; and Dennis, E.A. Phospholiphases. 
In: The Enzvmes . edited by Boyer, P. New York, 
Academic Press, p. 307-353 (1983). In all active 
15 14kDa PIiA2S, 18 amino acids are believed to be 

conserved. See Heinrikson, R.L. : Methods in 

Kn7:vmoloav . 197:201-214 (1991) and Davidson, F.F. .3^ 
Mol. Evolution . 31:228-238 (1990). Based on selected 
structural determinants, the low molecular weight 
2 0 PIA2S have been classified into two types. See 

Heinrikson, R.L. et al.: Bi,ol . Chem. , 

252:4913-4921 (1977). Type I enzymes have a 
disulfide bridge connecting cysteines between amino 
acids 11 and 77. In addition, there is an insertion 
2 5 of three amino acids between residues 54 and 56, the 

so-called elapid loop. The only identified mammalian 
Type I PLA2^' ®®® Seilhamer, J.J. et al . : DHA, 
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5:519-527 (1986) and Ohara, O. et al.: J, Biochem. , 
99:733-739 (1986), are expressed mainly in the 
pancreas and function extracellularly in digestion. 
Type II on the other hand, lack the disulfide 

5 bridge between amino acids 11 and 77, have 

carboxy-terminal (COOH-terminal) amino acid 

extensions which can vary in length, but are commonly 
seven amino acids in length, which typically 
terminate in a half -cysteine joined to Cys-50 near 
10 the catalytic site His-48. The mammalian Type II 

PLA2S reported to date occur in trace amounts in 
several tissues such as liver and spleen and are 
secreted from various cells in response to 
appropriate stimuli. See Seilhamer, J.J. et al. : is. 
15 Biol, Chem. , 264:5335-5338 (1989); Kramer, R,M. et 

al.: J. Biol. Chem. , 264:5768-5775 (1989); Komada , M. 
et al,: J. Biochem. , 106:545-547 (1989); Kusunoki , C. 
et al.: Biochimica Et Biophvsica Acta . 1087:95-97 
(1990); Aarsman, A.J. et al. : J, Biol. Chem. . 
20 264:10008-10014 (1989); Ono, T. et al . : J. Biol. 

Chem. . 264:5732-5738 (1988); Horigome, K. et al . : J. 
Biochem. , 101:53-61 (1987); Nakano, T. et al.: Febs. 
Letters , 261:171-174 (1990); and Schalkwijk, C. et 
ai-i Bioche m. & Biophvs. Res. Comm. . 174:268-272 
25 (1991). It is believed that Type II PLA2S are 

associated with the pathologies of several diseases 
involving infection, tissue damage, and inflammation, 
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such as acute pancreatitis, septic shock, peritonitis 
and rheumatoid arthritis. See Vadas, P. et al.: Lab. 
Invest. . 55:391-404 (1986); Pruzanski, W. et al.: 
Advances in Exper. Med. & Biol. . 279:239-251 (1990); 
Uhl, W. et al.: J. Trauma . 30:1283-1290 (1990); and 
Malf ertheiner, P. et al. : Klinische Wochenscrift . 
67:183-185 (1989). Mammalian Type I and II PLA2® 
share approximately 3 0-40% amino acid homology; 
however, eighteen amino acids are invariantly 
conserved in all known functional PLA2S- TyP® I 
mammalian PIiA2 genes contain 4 coding exons; Type II 
mammalian genes contain five exons, the first of 
which is noncoding. 

In 1990, a distinct 120 bp putative PLA2 
exon-like fragment (hlOa) , homologous to the 
amino-terminus encoding region of known PLA2S, was 
obtained by screening a human genomic DNA library 
with a 4 5 bp human PLA2 Type II oligonucleotide 
probe. See Johnson, L.K. et al . : Advances in Exoer . 
Med. & Biol.. 275:17-34 (1990). Zoo blots indicated 
that the putative exon has been highly conserved 
during evolution. However, additional exons 

indicating the presence of a complete gene, a 
corresponding cDNA, or evidence of transcription in 
different human tissues was not found. 

Neuronal ceroid lipf uscinoses (NCL) , or 
Batten disease, are terminal, inheritable, lysosomal 
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storage diseases of children* They are characterized 
by the acctamulation of an autof Increscent pigment 
(ceroid or lipofuscin) in cells, especially neurons 
and epithelial pigment cells of the retina. NCL 
5 patients typically manifest high levels of the highly 

reactive compound, 4-hydroxynonenal • These levels 
are believed to be a consequence of a failure to 
resolve peroxidized, fatty acids in the normal way. 
It is believed that this failure could be the result 

10 of a phospholipase A2 defect. 

The infantile form of NCL has now been 
linked to chromosome lp33-35. See Jarvela, I. et 
al.: Genomics , 9:170-173 (1991). The non-pancreatic 
PLA2 (*ryp® also been mapped to chromosome 1. 

15 The Type II gene and two additional putative 

exon-like "fragments" (h8 and hlOa) , see Johnson, 
L.K. et al.: Advances in Exper. Med. & Biol. . 
275:17-34 (1990), are located at about lp34 - in the 
middle of the region where gene for infantile NCL is 

2 0 believed to reside. See Jarvala, I, et al.: 

Genomics . 9:170-173 (1991). h8 and hlOa each contain 
a unique sequence which is highly homologous to, but 
distinct from, exon two (which contains the calcium 
binding domain) of PLA2 Type II. 

25 Consequently, there is a continuing need to 

further identify and characterize additional PLA2 
exons if such exist- Such exons could be part of 
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unidentif ied genes. To the extent there are 
additional unidentified PI1A2 exons and genes, they 
may encode proteins (enzymes) that function in a 
manner different from, similar to, or overlapping 
5 with, the known PIiA2S. Moreover, such unidentified 

exons and/ or genes and the enzymes encoded thereby 
may be regulated by some of the same effectors as the 
known PIiA2 genes and their proteins- Investigation 
of these unidentified exons and/or genes and their 

10 encoded enzymes may therefore result in new 

approaches to therapy of PLA2"^®1^'^®*^ diseases, such 
as Batten disease and inflammatory disease. 
Alternatively, these unidentified enzymes may have 
entirely different physiologic and pathologic 

15 functions. Thus, therapeutic approaches designed to 

block the activity of the known Type II PIiA2 enzymes 
may also block or reduce the activity of these novel 
enzymes, thereby producing unexpected side effects. 
Still further, a better understanding of the 

2 0 regulation of expression of the known and 

unidentified Type II PLA2 genes in different tissues 
will likely expand the overall understanding of the 
biology and metabolic processes involving PIiA2S. 
gnirnna-r y of -the Invention 

2 5 In brief, the present invention overcomes 

certain of the above-mentioned shortcomings and 
drawbacks associated with the present state of the 
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PLA2 through the discovery of a novel family of 

mammalian PIiA2 genes or nucleic acid sequences 
encoding low molecular weight amino acid sequences, 
clones, vectors, antisense nucleotide sequences, 
nucleotide sequences having internal binding sites, 
and cell lines. 

More particularly, the low molecular 
weight, i.e., about 14kDa, amino acid sequences 
encoded by the novel family of mammalian PLA2 genes 
or sequences of the present invention may be 
generally characterized as enzymes having esterase 
activity specific for the acyl group at the sn2 
position of glycero-phospholipids. Moreover, the 
novel amino acid sequences of the present invention 
do not include disulfide bridges between cysteine 
amino acids 11 and 77 and elapid loops. Still 
further, the novel amino acid sequences of the 
present invention may in some instances include 
COOH-terminal amino acid extensions which can vary in 
length. In addition, because of the difference in 
the nuitiber of cysteine residues in the encoded amino 
acid sequences, those novel of the present 

invention that include 16 cysteine amino acid 
residues have been designated as Type III whereas 
those novel Type IV PLA2S of the instant invention 
include 12 cysteines and have been designated at Type 
IV. Exemplary of Type III PIA2S of the present 
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invention are the genes identified as RPLA2-8 (rat) 
and partial HPI1A2-8 (human) , as well as the RPLA2-8 
(rat) cDNA. Examples of Type IV PI1A2S of the present 
invention are the cDNAs identified as RPLA2-IO (rat) 
5 and HPIiA2-10 (human) . 

In accordance with the present invention, a 
human PLA2 -encoding cDNA, which expresses HPIA2-IO, 
see FIG. 12, has been isolated from human brain RNA 
by RACE-PCR technique. The HPLA2-10 cDNA also has 
10 been isolated from a human stomach cDNA library. In 

addition, two rat PIiA2 encoding cDNAs, designated 
RPIA2-8 (FIG. 3) and RPLAj-lO (FIG. 11) , have been 
isolated from rat brain and heart cDNA libraries, 
respectively. The RPLA2-IO is believed to be the 
15 counterpart of the HPLA2"^0- RPLA2-IO and HPIA2"^^ 

share about 79% and 78% homology at the open reading 
frame nucleic acid and amino acid sequence levels, 
respectively. The mature enzyme encoded by the 
HPLA2"^0' clone has a calculated molecular weight of 
20 about 13,592, whereas the mature enzyme encoded by 

the RPLA2-8 clone has a calculated molecular weight 
of about 14,673. As indicated, a partial human 
genomic counterpart to RPI1A2-8, HPI1A2-8 genomic DNA, 
has been isolated. See FIG. 19. 

Comparison of the RPIiA2~8 amino acid 
sequence deduced from the cDNA sequence to Type I and 
Type II PLA2S is shown in FIGS. 8 and 9. The signi- 



25 
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f leant structural features of the RPLA2-8 protein are 
summarized in TABLE I. Seventeen (17) of the 
eighteen (18) absolutely conserved amino acids in all 
active 141cDa PIiA2S are conserved in RPI1A2-8. RPtA2''8 
protein does not contain either a disulfide bridge 
between Cysteines 11 and 77 or an elapid loop, which 
are both characteristic of Type I PLA2S. RPLA2-8 
protein, however, does include a seven amino acid 
COOH-tenninal extension having the sequence GRDKLHC, 
as shown in FIG. 27, which is a characteristic of 
Type II PIiA2S as evidenced in FIGS. 2 2 and 27. 
Furthermore, unlike mammalian type I and II PIjA2S 
which have 14 cysteine amino acid residues, RPLA2-8 
protein includes 16 cysteine amino acid residues. It 
is therefore believed that RPI1A2-8 encodes a novel 
.PLiA2, which has been designated as PIA2 Type III. 

The cDNAs of RPIA2-10 and HPLA2-IO are 
l.Skb (FIG. 11) and l.lkb (FIG. 12), respectively. A 
comparison between the deduced amino acid sequences 
from RPLA2-IO and HPLA2-IO is shown in FIG. 13. 
FIGS. 14 and 15 are comparisons between the HPI1A2-IO 
deduced amino acid sequence and those of Type I and 
II human PLA2S/ respectively. FIGS. 18 and 16 are 
comparisons between the RPLA2-IO deduced amino acid 
sequence and those of Type I and II rat PIiA2S, 
respectively. A comparison between the deduced amino 
acid sequences from RPLA2-10 and RPLA2"S shown in 
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FIG. 17, Th major structural features of human and 
rat PLA2"^^ deduc d amino acid sequences are listed 
in TABLE I. All eighteen (18) conserved amino acids 
in all of the active low-molecular weight, 
5 approximately 14kDa, PIiA2S are conserved in both 

human and rat PIiA2-10 amino acid sequences of the 
present invention. Like the predicted RPLA2-8 amino 
acid sequence, human and. rat PLA2-IO amino acid 
sequences also lack disulfide bridges between Cys-11 

10 and 77 and elapid loops. However, PLA2-IO amino acid 

sequences are believed to differ from RPLA2-8 protein 
by having twelve (12) cysteine residues instead of 
sixteen (16). They further differ from RPLA2-8 in 
that RPLA2-10 does not have a COOH- terminal amino 

15 acid extension as depicted in FIG. 27 and HPLA2-IO 

has only a single serine amino acid COOH-terminal 
extension as illustrated in FIG. 22. The PLA2-IO 
proteins of the present invention have therefore been 
designated, as mentioned hereinbefore, as PLA2 Type 

20 IV. 

The present invention also contemplates 
antisense nucleotide sequences which are 
complementary to the genes and mRNA transcripts which 
encode for the Type III and Type IV PLA2S. Exemplary 
25 of antisense sequences in accordance with the present 

invention are those which are complementary to the 
entire or portions of the nucleotide sequences set 
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forth in FIGS- 3, 11, 12 and 19. It: should therefore 
be understood that the present invention contemplates 
any antisense nucleotide sequence which may be useful 
in connection with inhibiting or interfering with th 
5 expression of the Type III and Type IV PLA2 enzyme 

genes and mRNA transcripts therefor. 

The above features and advantages will be 
better understood with reference to the FIGS., 
Detailed Description and Examples which are set out 
10 hereinbelow. It should be understood that the 

biological materials of this invention are exemplary 
only and are not to be regarded as limitations of 
this invention. 

Brief EtescriPtion of the FIGS. 

15 Reference is now made to the accompanying 

FIGS . in which are shown characteristics 
corresponding to the novel mammalian 14KD PLA2S of 
the present invention from which certain of their 
novel features and advantages will be apparent: 

2 0 FIG. 1 depicts a diagram of RPLA2-8 cDNA 

showing positions of open reading frame coding 
region, repeats, and 5' and 3' termini (the first and 
last eight (8) nucleotides are cloning linkers) ; 

FIG. 2 depicts a postulated secondary 

2 5 structure of RPIiA2-8 cDNA showing a stem and a loop 

containing the open reading frame coding region; 
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FIG. 3 depicts the RPLA2"^ cDNA and derived 
amino acid sequence (th first and last eight (8) 
nucleotides are cloning linkers) ; 

FIG. 4 depicts a diagram of the genomic DNA 
5 region containing exons 2, 3 and 4 of RPLA2-8 in 

comparison to the corresponding cDNA; 

FIG. 5 is a comparison between HPI1A2-8 Exon 
I and RPLA2-8 Exon I sequences; 

FIG. 6 is a comparison between HPLA2-8 Exon 
10 II and RPLA2-8 Exon II sequences; 

FIG. 7 is a comparison between RPLA2-8 Exon 
IV and RPIiA2-8 Exon IV sequences; 

FIG. 8 is a comparison of RPLA2-8 deduced 
amino acid sequence and rat PIiA2 Type I amino acid 
15 sequence; 

FIG. 9 is a comparison of the RPIiA2-8 
deduced amino acid sequence and rat PIA2 Type ^11 
amino acid sequence; 

FIG. 10 depicts a flow diagram of 3' and 5' 
2 0 RACE-RT PGR techniques used to obtain a full length 

HPLA2-IO sequence cDNA from brain mRNA; 

FIG. 11 depicts the RPIiA2-10 cDNA sequence 
shoving primary cDNA sequence and various primer 
sequences, which are used in sequencing and 
25 synthesis, are underlined; 

FIG. 12 depicts the HPIiA2-10 cDNA (Type IV) 
sequence and a secondary (clone HPLA2IO-5) cDNA 
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sequence which is slightly different at the 5' end 
and forshortened- Various primer sequences used in 
sequencing and synthesis are underlined. 

FIG. 13 is a comparison between deduced 
5 amino acid sequences of HPIiA2-10 and RPLA2-IO; 

FIG. 14 is a comparison between HPIA2-IO 
deduced amino acid sequence and human Type I amino 
acid sequence; 

FIG. 15 is a comparison between HPLA2"^^ 
10 deduced amino acid sequence and human PLA2 ^YP© H 

amino acid sequence; 

FIG. 16 is a comparison between deduced 
amino acid sequences of RPIiA2-10 and rat PLA2 Type II 
amino acid sequence; 
15 FIG. 17 is a comparison between deduced 

amino acid sequences of RPI1A2-IO and RPLA2-8; 

FIG. 18 is a comparison between deduced 
amino acid sequence of RPIA2-IO and rat PIiA2 Type I 
amino acid sequence; 
2 0 FIG. 19 depicts the partial human genomic 

HPLA2"® sequence. Putative exon 1 and exons 2 

and 4 are underlined; 

FIG. 20 depicts a diagram of the vector to 
express discistronic mRNA. The chloramphenicol 
25 acetyl transferase and luciferase reporter genes are 

indicated by boxes. The intercistronic sequence that 
is replaced by part of RPIiA2-8 is shown; 
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FIG. 21 illustrates PLA2 activity of 
expressed HPIiA2-10 cDNA. pCHlO is HPIiA2-10 cDNA 
cloned into an Epstein Barr virus-based expression 
vector. CpCHlO-lB, CpCHlO-lC, CpCHlO-lD and 

5 CpCH2 0-2G are independent cell lines which express 

plasmid pCHlO. The CpRASF-2B is a cell line which 
expresses plasmid pRASF into which a known human PL1A2 
Type II gene has been cloned. 

FIG. 22 depicts an alignment of amino acid 
10 sequences of human Types I, II and HPLA2-10 PLA2 • 

Asterisks denote eighteen residues that have been 
conseirved among all active PIiA2 sequences. The 
COOH-terminal amino acid extensions have been 
underscored ; 

15 FIG. 23 depicts the effects of pH on PIiA2 

activity of RPIiA2-8 encoded enzyme (Type III) . More 
particularly, FIG. 2 3 depicts the effects of pH on 
PLA2 activity of RPLA2-8 enzyme expressed by cell 
line CpR8-3'. The CpR8-3 ' cell line expresses 

2 0 plasmid pR8-3 ' which includes the coding region for 

the mature RPIiA2-8 protein (bases 806-1200) which is 
preceded by the signal peptide of pRASF (bases 
131-196) . Assay for PIiA2 activity is as indicated 
herein and in Elsbach, P, et al.: Methods in 

25 Enzvmoloov . 197:24-31(1991); 

FIG. 24 depicts the effects of calcium on 
PIiA2 activity of RPLA2-8 encoded enzyme (Type III) . 
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More particularly, FIG. 24 depicts the effects of 
calcium on PIA2 activity of RPI1A2-8 enzyme expressed 
by cell line CpR8-3 ' . The CpR8-3 ' cell line 
expresses plasmid pR8-3' which includes the coding 
region for the mature RPLA2-8 protein (bases 
806-1200) which is preceded by the signal peptide of 
pRASF (bases 131-196) . Assay for PIiA2 activity is as 
indicated herein and in Elsbach, P. et al.: Methods 
in Enzvmoloay . 197:24-31(1991) ; 

FIG. 25 depicts the effects of pH on PLA2 
activity of HPLA2-IO encoded enzyme (Type IV) . More 
particularly, FIG. 25 depicts the effects of pH on 
PLA2 activity of PLA2 Type II enzyme expressed by 
cell line CpRASF-2B and of PIA2 Type IV enzyme 
expressed by cell line CpCHlO-lD. The CpRASF-2B cell 
line expresses plasmid pRASF into which a known human 
PLA2 Type II gene has been cloned. The CpCHlO-lD 
cell line expresses plasmid pCHlO into which the 
HPLA2"10 cDNA has been cloned. Assay for PLA2 
activity is as indicated herein and in Elsbach, P, et 
al.: Methods in Enzvmoloay , 197:24-31 (1991); 

FIG. 2 6 depicts the effects of calcium on 
PLA2 activity of HPLA2"^0 encoded enzyme (Type IV) . 
More particularly, FIG. 2 6 depicts the effects of 
calcium on PIA2 activity of PLA2 Type II enzyme 
expressed by cell line CpRASF-2B and of PLA2 Type IV 
enzyme expressed by cell line CpCHlO-lD. The 



SUBSTITUTE SHEET (RULE 26) 



wo 95/02328 PCT/US94/07926 

-17- 

CpHASF-2B cell line expresses plasmid pRASF int:o 
which a known human PIiA2 Type II gene has been 
cloned. The CpCHlO-lD cell line expresses plasmid 
pCHlO into which the HPIA2"10 cDNA has been cloned. 
5 Assay for PIA2 activity is as indicated herein and in 

Elsbach, P. et al.: Methods in Enzvmoloav . 197:24-31 
(1991) ; and 

FIG. 27 depicts an alignment of amino acid 
sequences of rat Types I, II, RPLA2"S RPIA2"^^ 

10 PLA2S, Asterisks denote eighteen residues that have 

been conserved among all active PLA2 sequences. The 
COOH-terminal amino acid extensions have been 
underscored . 
Detailed Description 

15 By way of illustrating and providing a more 

complete appreciation of the present invention and 
many of the attendant advantages thereof, the 
following detailed description is provided concerning 
the novel mammalian PLA2 nucleotide sequences, the 

20 low molecular weight amino acid sequences encoded 

thereby, clones, vectors, antisense nucleotid 
sequences, nucleotide sequences having internal 
ribosome binding sites, and cell lines. 

In accordance with the present invention, a 

25 4.4 kb cDNA containing the r8 fragment, a rat genomic 

fragment containing sequences homologous to h8 
fragment, is isolated from a rat fetal brain cDNA 
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library. See FIG. 1. This cDNA is about five-times 
larger than any mammalian PI*A2 cDNA Icnown to date. 
Uniquely, the entire coding region is contained on a 
putative 1 kb loop flanked by 121 bp inverted perfect 
5 repeats, leaving about a 3 kb 3 ' "tail." See FIG* 

2. The sequence of the entire cDNA is shown in FIG. 

3. The size of the gene is about 15 kb. See FIG. 

4. A preliminary screen of some rat tissues by 
reverse transcription and PGR (RT-PCR) , using primers 

10 Pla8-1 and Pla8-4 , reveals the pattern of RPLA2*8 

gene expression indicated in Table I. 
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TABLE I 

Characteristics of Type III and IV PI«A2S 

EjLfi* . ££ii* ^ nature* 

Hum Type I MKLLVIiAVLLTVAAA-*- DSGISPR^ AVWQF^ 

Hum Type II MKTLLLAVIMIFGLLQAHG^ NLVNF^ 

Rat Type III MDLLVSSGMKGIAVFLVFIFC^ (WTTSTLS)'^ SFWQF^ 

Hum Type IV MKGLLPLAWFLACSVPAVQG^ GLLDL^° 

Rat Type IV MKRLLTLAWFLACSVPAVPG ^ ^ GLLEL^^ 

Human Type I PLA2 has a 7 residue propeptide, human 
Type II does not. Human and rat Type IV are like Type 
II; Rat Type III might encode a 7 residue propeptide. 

' depicts the NHg-tenni nal amino acids in the amino acid sequences for the 
respective prepeptides, propeptides and mature peptides. 

Vepresents SEQ ID N0:1:: ^represents SEQ ID N0:2:; ^represents SEQ ID N0:3:; 

represents SEQ ID N0:4: ; ^represents SEQ ID N0:5:; ^represents SEQ ID N0:6:; 

represents SEQ ID N0:7:; ^represents SEQ ID N0:8:: f represents SEQ ID N0:9:; 
'"represents SEQ ID NO: 10:; ''represents SEQ ID NO: 11:; '^represents SEQ ID NO: 12:. 

Conserved Characteristics of Pre, Pro and Mature Peptides: 

Bgt Type III Human and Rat TvT>e IV 

Phe5 Ile9 

Met8 Mete 

YGCYCG Ca^"*" binding loop YGCYCG €3^+ binding loop 

His48, Asp49 active site His48, Asp49 active site 

Position of Cys residues Position of Cys residues 

(disregarding the two (disregarding the two 

extra Cys residues) missing Cys residues) 

Unusual Characteristics of Pre, Pro and Mature Peptides: 

Rat Type III Human an d Rat Type IV 

Val9 Leu5 

Two extra Cys residues Two missing Cys residues 

Ala 102, 103 missing Ala 102, 103 missing 
Unusually large variable 
peptide loop 

Other Characteristics of Pre, Pro and Mature Peptides: 

Rat Type IH """tb^ a nd Rat TVne tv 

No elapid loop No elapid loops 

No disulphide bridge No disulphide bridges 

between Cys 11 and 77 between Cys 11 and 77 

Sixteen Cys residues Twelve Cys residues 

Seven COOH-terminal amino Human Type IV-one serine 
acid extension-GRDKLHC COOH-terminal extension 

Rat Type IV-no COOH- 
terminal amino acid 
extension 

**The numbers designating the positions for the amino 
acids in Table I are for th mature peptides. 
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Mor.eover, according to Northern Blot data of several 
tissues, a RPLA2 mRNA is detected in only the testis 
indicating that the RPIA2-8 gene is testis specific, 
as reported in Table II. 



TABI£ II 



Northern blot data 



Type IV (cl 10) h\aman 



brain 

heart 

kidney 

liver 

lung 

pancreas 

placenta 

skeletal muscle 

spleen 

testis 



Type IV (cl 10) rat Type III (cl 8) rat 



brain 

heart 

kidney 

liver 

lung 

skeletal muscle 

spleen 

testis 
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Using parts of RPIiA2-8 as probes, a partial 
hiiman genomic cl ne which is homologous to rat 
genomic clone is identified. See FIG. 19. To date, 
5 all but the third of the four exons in the human 

genomic DNA, see FIGS- 5-7, is identified and 
sequenced. The 3' flanking regions of the human and 
rat genes show very significant homology (about 50 
percent) for about 500 bp. This conservation is 

10 unusual and suggests functional importance. It is 

functionally demonstrated that RPIiA2-8 cDNA contains 
an internal ribosome binding site that enables 
internal translation initiation. 

A comparison of the significant structural 

15 features of the putative protein encoded by RPIA2-8 

cONA sequence and encoded amino acid sequence to 
those of the corresponding pancreatic and 
non-pancreatic PIiA2 enzymes are shown in FIG. 8 and 
9 . Pancreatic PLA2 is known as Type I and the 

20 non-pancreatic PIiA2 is designated as Type II. It is 

believed that PLA2-8 encodes a novel PLA2 which is 
designated as Type III. An enzyme encoded by a gene 
containing the hlOa sequence is designated Type IV 
(see below) . The proximity (within about a million 

25 base pair region in the mouse) of the genes for Types 

III and IV to the PIiA2 Type II gene suggests a common 
evolutionary origin as does their localization to the 
same band on human chromosome 1. Further, Types II, 
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III and IV are likely to be members of a gene family 
and may represent isozymes- However, a homology 
comparison indicates that the protein is 

relatively distant, evolutionarily , from both Type I 
and Type II PIiA2 enzymes, but is believed to be 
probably closer to Type II. 

In accordance with the present invention, 
human cDNA that contains the hlOa fragment and rat 
cDNA that contains the rat counterpart are isolated. 
See FIGS. 11 and 12. The predicted protein sequences 
of HPLA2-10 and RPLA2'^^ comparisons to each 

other and Types I and II are shown in FIGS. 13-17. 
Some of the significant structural features of the 
proteins encoded by these cDNAs are shown in TABLE 
I. Importantly, the 18 amino acids that are believed 
to be requisite for PLA2 function are conserved in 
both predicted proteins. See FIG. 22. This fact, 
plus the high degree of conservation between species, 
suggests that these Type IV proteins play an 
important role in phospholipid metabolism and 
processes such as membrane structuring, inf laiamation 
and intracellular signaling. 

The amino acid sequences of the present 
invention may be produced by, for example, 
recombinant technology, chemical synthesis or any 
other methods available in the art so long as the 
methodology selected does not interfere with their 
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utilities. Likewise, the nucleotide sequences of the 
instant invention may be produced by, for instance, 
PGR technology, chemical synthesis or any other 
methods available in the art so long as the 
5 methodology selected does not interfere with their 

utilities. Moreover, amino acid residues may be 
deleted or added or alternative amino acid residues 
may be substituted for those recited in the amino 
acid sequences of the instant invention so long as 

10 such changes do not defeat the utilities of such 

amino acid sequences. Still further, it should be 
appreciated that the present invention contemplates 
any amino acid sequences which are equivalent to or 
constitute active fragments of the amino acid 

15 sequences for the Type III and Type IV PLA2 enzymes 

of the present invention. Of course, corresponding 
or other changes may be made to the nucleotide 
sequences of the present invention to accomplish the 
objectives of this invention. 

2 0 It should also be appreciated that the 

present invention contemplates a.) any antisense 
nucleotide sequences which are capable of inhibiting 
or interfering with expression of genes and mRNA 
transcripts encoding Type III and Type IV PIiA2 

2 5 enzymes of the present invention, including any amino 

acid sequences that are equivalent thereto or active 
fragments thereof, and b.) any nucleotide sequences 
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having bases 116-720 of FIG. 3 and any equivalent 
fragments thereto or active fragments thereof that 
allow for internal initiation of mRNA cap-independent 
translation. Like other nucleotide sequences of the 
5 present invention, substitutions, deletions and 

additions may be made to the antisense nucleotide 
sequences and the nucleotide sequences having 
internal ribosome binding sites of the present 
invention so long as the objectives of the present 
10 invention are not defeated. 

HPLAj^lO 

In order to clone an cDNA containing the 
putative HPLA2 exon, two primers, HClolO-1 and 
HClolO-la, are generated according to the 12 0 bp 

15 presumptive exon II sequence. See FIG. 12. PGR 

amplification with these primers is used to screen 
human child brain, adult brain, liver, heart, and 
various white cell cDNA libraries. PGR amplification 
products are not obtained. 

2 0 Since zoo blots have indicated that this 

putative exon is evolutionarily conserved, a rat 
genomic cosmid library (Clontech, Inc.) is screened 
using a PCR-generated copy of the HClolO-1 
HClolO-la fragement as a probe. Three unique 

2 5 positive clones are identified. Southern blot 

anaysis of the three EcoRI -digested clones using the 
HClolO-1 - HClolO-la fragment as a probe identifies a 
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common 5kb band. This band is subcloned into 
EcoRI -digested pUC13 and sequenced. A region (rat-10 
putative exon II) in the 5 kb sequence highly 
homologous to hlOa is identified by computer analysis. 

In order to search for the presence of exon 

I, the 53cb human genomic DNA clone containing 
putative exon II is sequenced completely. Computer 
analysis of the sequence identified two highly 
homologous regions. One appears to be exon II. It 
contains a consensus splice acceptor site at its 5' 
end and a consensus splice donor site at its 3' end. 
The other region, located about 1.2 kb 5' of the exon 

II, contains a consensus splice donor site at its 3' 
end and a putative in-frame ATG start codon at its 5' 
end. It is likely to be exon I. Furthermore, when 
these two putative exons are joined together using 
the assumed splice donor and acceptor sites, the 
resulting sequence encodes a signal peptide and 41 
amino acids which have significant homology to the 
amino terminus of known, mature PLA2S. 

After determining the putative exon I 
sequence, HIO-A, a 5' primer located within exon I, 
and HlO-la, a 3' primer located within exon II, see 
FIG. 12, are used for RT-PCR of tbtal human brain and 
lymphoblast RNA. A unicjue 140bp band from both PGR 
reactions is sequenced. The 14 0 bp contains coding 
exons I and II, but not the putative intron I of 
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HPLAj-lO. 5' and 3' RACE-RT PGR techniques, Frohman, 
M.A. et al.: PNAS . 85:8998-9002 (1988); O'Hara, O. et 
al.: PNAS. 86:6883-6887 (1989); and Loh, Y. et al.: 
Science. 243:217-220 (1989), are then used to 
5 generate the full length cDNA sequence from total 

human brain RNA. See FIG. 10. The entire cDNA 
sequence, designated HPLAj-lO, is shown in FIG. 12. 
Exon-intron junction sites are determined by genomic 
DNA analysis. Since the genomic DNA clone containing 
the first 120 bp of HPlAj-lO is not obtained, it has 
not been determined if there are any introns in this 
region of the HPLAj-lO genomic sequence. if no 
additional exons are found, HPLAj-lO will apparently 
have an exon-intron structure typical of known Type 
II PIAjS with a 5' untranslated exon followed by four 
protein coding exons. 

Primers HIO-A (bases 149-170) and HIO-C 
(bases 52 0-548) are used to screen by PGR 
amplification a human stomach cDNA library 
(Clonetech, Inc.). A 399 bp and a 290 bp PGR 
amplification product are obtained only from the 
stomach cDNA library. The two PGR fragments are 
cloned into pUG19 and sequenced. The sequence of the 
399 bp fragment is identical to the HPLAj-lO RACE-RT 
2 5 PGR generated cDNA sequence from bases 148 to 541. 

The 290 bp fragment is identical to the 3 99 bp 
fragment except that it is missing bases 316 to 422 



15 



20 
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which encompass the 5' end of exon III. See FIG- 
11. The same two PCR fragments are also amplified 
from total human brain and lymphocyte RNA using 
primers HIO-A and HIO-C. The 290 bp PCR product is 
5 much less abundant than the 3 99 bp product when 

amplified from human stomach and brain RNA and 
stomach cDNA library. Since the 290 bp product codes 
only for the signal peptide and the first 41 amino 
acids of the mature protein because of an in-frame 

10 stop codon immediately following the 41st amino acid, 

the in vivo significance of this product is unknown 
at this time- 

Using the 399 bp PCR product as a probe, 
6x10^ individual plaques from the human stomach cDNA 

15 library are screened. Four positive clones are 

identified. The clones, designated HPLA2"10-"2, -3, 
-5,. -7, have inserts of 1.4, 2.3 0.9, and 0,8 kb, 
respectively. The inserts of these clones are 
released by EcoRI digestion, subcloned into pUC19 and 

2 0 sequenced completely. HPIiA2-10-2 contains exon 

I-intron I-exon II of HPLA2-IO; HPLA2-IO-3 contains 
intron Ill-exon IV-intron IV of HPLA2"1^- "T^® 
sequences of both HPLA2"1^"^ ^rid HPLA2-IO-7 are 
identical to the corresponding regions of the 

25 RACE-RT-PCR generated HPLA2"^0 sequence except that 

the 5' end of the HPLA2-IO-5 starts at base 142 of 
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the RACE-RT-PCR sequence and the 5' end of HPLA2-IO-7 
starts at base 23. 

To determine the transcription pattern of 
HPLA2-IO, a Human Multiple Northern Blot (Clontech, 
5 Inc.) is probed with a 399 bp fragment, i.e., 

HPIA2-IO PGR probe, generated by PGR with primers 
HIO-A (bases 149-170) and HIO-C (bases 520-548) , As 
seen in TABLE II, a 1.2 kb transcript is detected in 
heart and, less abundantly, in liver and lung RNA. 

10 In addition, a 2 kb transcript is detected in 

placental RNA. This suggests that the expression of 
HPIiA2-10 is not only tissue specific, but that 
alternative forms of the protein may be expressed in 
different tissues. The 2 kb transcript seen in 

15 placental RNA may result from the use of a different 

promoter, alternative splicing or the use of an 
alternative poly A site. 

The HPIiA2-10 cDNA encodes a mature protein 
of about 118 amino acids with a calculated molecular 

20 mass of about 13,592 Daltons. The amino acid 

sequence has significant homology to known PLA2S. 
All of the 18 invariantly conserved amino acids in 
known active low molecular weight PLA2S, see 
Davidson, F.F. : J. Mol. Evolution . 31:228-238 (1990), 

25 are conseirved in this novel protein. See FIG. 22. 

However, HPLA2-IO contains neither the disulfide 
bridge between Gys 11 and 77 nor the elapid loop 
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characteristic of Type I PLA2S. HPLA2-IO does, 
however, c ntain a single serine amino acid 
COOH-terminal extension, as shown in FIG. 22, which 
is more characteristic of a Type I than Type II 
PI1A2 . As depicted in FIG. 22, Human Type I has a two 
amino acid COOH-terminal extension whereas Human Type 
II has a seven amino acid COOH- terminal extension* 
Furthermore, unlike mammalian Types I and II PLA2S 
which have 14 cysteine residues, this putative HPLA2 
only has 12. The overall homology between HPLA2-IO 
and a consensus Type I PLA2 is about 3 0.5% while the 
overall homology between HPIiA2'"^0 ^ consensus 

Type II PLA2 is about 40.6%. The predicted 
isoelectric point (pl) of this protein is about 6.2 
while that of other known Type II PI1A2S is about 10.5. 

To test whether this HPLA2-10 gene encodes 
an active, secreted PIA2 , an Epstein Barr virus-based 
expression vector (pCEP) is used to express the 
HPLA2-IO cDNA in human 2 9 3s cells. pCEP contains two 
regions of the EBV genome required for episomal 
maintenance (EBNA-1 and OriP) , a drug resistance gene 
for selection in human cells (hyg) , bacterial 
sequences for maintenance in E. coli, a drug 
resistance gene for selection in E. coli (amp) , and 
an expression cassette for the production of high 
levels of mRNA from an introduced sequence by using 
an Rous/Sarcoma virus long terminal repeat (RSV LTR) 
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promoter and an Simian virus 4 0 (SV40) 
polyadenylation signal. HPIA2-IO-5', a 5' primer 
beginning at base 12 6 of HPLA2-IO and containing a 10 
nucleotide Nhel linker at its 5' end, and 
HPLA2-IO-3', a 3' primer ending at base 555 and 
beginning with a 10 nucleotide Xhol linker, are used 
for reverse-transcriptase-polymerase chain reaction 
(RT-PCR) of total human brain RNA to generate the 
appropriate cDNA insert. The PGR product is 
blunt-end ligated to HincII-digested pUC19 and 
sequenced. The insert is then released by digestion 
with Nhel and Xhol and is cloned into the Nhel -Xhol 
sites of pCEP. The resulting plasmid is designated 
pChlO. 

A known human Type II PLA2 cDNA is cloned 
into pCEP for use as a positive control. PGR primers 
RASF-5' and RASF-3' are used to amplify bases 130 to 
581 of pRASF, a plasmid containing the entire human 
known PLA2 Type II cDNA. See Seilhamer, J.J. : 

Biol . Chenu, 264:5335-5338 (1989). The resulting 

plasmid is designated pRASF and is used as a 
control. The HPLA2-2B (Type II) enzyme, as depicted 
in FIGS. 25 and 26, are expressed by pRASF and used 
as a control. 

Purified plasmid DNA is transfected into 
human 29 3 s cells which are selected in DMEM 
containing 200 ug/ml hygromycin. Medium samples from 
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multiple cell lines transfected with either pCHlO, 
pR8-3' or pRASF are then assayed for PLAj activity. 
See FIG. 21. PLA2 activities derived from cell lines 
transfected with plasmids pCHlO, pR8-3', and pRASF 
5 are accumulated in the medixim. Neither 293s cells 

nor multiple cell lines transfected with an unrelated 
PIA2 cDNA inactivated by a one base pair deletion at 
the 5' end of the mature protein show detectable PLA2 
activity in the medium even after 72 hours. Cell 
10 lysates that are prepared by sonication from cells 

stably transfected with either pCHlO or pRASF show 
approximately 50% of the activity of 72 hour medium 
samples. 

Two cell lines, CpCHlO-lD expressing pCHlO 
15 and CPRASF-2B expressing pRASF, are chosen for 

comparative study. The pH profile for the enzyme 
expressed by the cell lines is shown in FIG. 25. 
PLA2 activity of HPIA2-10 starts at about pH 5 and 
significant activity is reached at between about pH 
6.5 and about pH 7 . 5 and remains relatively steady up 
to at least about pH 9.5, whereas the control Type II 
PLA2 reaches peak activity at between about pH 7.0 
and about pH 7 . 5 and then progressively declines. 

Calcium concentration versus enzyme 
activity profiles for CpCHlO-lD and CpRASF-2B are 
shown in FIG. 26. HPLA2-IO appears to be a 
calcium-dependent PLA2 having activity starting at 
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about 0.07 mM Ca^^ amd reaching maximal activity at 
between about 7 mM and about 100 mM Ca^"*"- The 
activity of HPIiA2-10 then slowly decreases, but 
maintains significant activity, as the Ca^"^ 
concentration approaches about 500 mM or more. This 
profile differs from that of the control cell line 
CpRASF-2 (Type II PIiA2) which shows maximal activity 
at between about 0.5 mM and 3.0 mM Ca^"*" and becomes 
inactive at Ca^"*" concentrations at about 100 mM or 
greater. Since HPLA2"^0 expresses at least half of 
its maximal activity at Ca^"*" concentrations between 1 
and 100 mM, similar to previously described Type II 
phospholipases, see Marshall: Biochemical 

Pharmacology . V. 44:1849-1858 (1992), it is likely 
that HPIiA2-10 is capable of functioning at 
concentrations found intracellularly (0.1 to 2 ^M) 
and extracellularly (ImM) . 
RPIA 2 *Q 

Two PGR primers, Pla8-1 and Pla8-2 (FIG. 
3) , are generated using the reported rat r8 
presumptive exon II sequence. See Seilhamer, J.J. et 
al.: J* Ce ll. Biochem. . 39:327-337 (1989). Four 
size-fractionated, newborn rat brain cDNA X2APII 
libraries (two 0.5-1. 5kb, one 1.5-4kb, and one 
greater than 43cb, provided by Dr. L. Yu, Indiana 
School of Medicine, are directly amplified by PGR, 
See Friedman, K.D. et al.: Nucleic Acids Research ; 
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16:8718 (1988), using primers pla8-l and pla-2- Only 
the >4 kb insert library gives the proper size 120 bp 
fragment prediced by the Clo8 DNA sequence. The band 
is purified from an agarose gel using a QIAEX gel 
5 extraction kit (QIAGEN) , cloned into ml3mpl8, and is 

sequenced using a Sequenase kit (USB) . The sequence 
data confirms the proper identity of the PGR 
product. A total of 10^ individual clones from the 
cDNA library are screened using the PGR product as a 

10 probe. Only two clones hybridize. The restriction 

maps of the two clones are believed to be identical. 
One of them, clo8-2, is secjuenced completely. The 
sequence, designated RPIA2-8, is shown in FIG. 3. 

RPI4A2-8 is a 4.4kb cDNA, which is about 

15 five-times larger than any known mammalian 14kDa PIiA2 

cDNA. See Seilhamer, J.J. et al . : DNA . 5:519-5.27 
(1986); Seilhamer, J.J. et al.: J . Biol , Chem. . 
264:5335-5338 (1989); Ohara, O. et al . : Proc. Natl. 
Acad. Sciences U.S.A. . 86:6883-6887 (1989); Kramer, 

20 R.M. et al.: J. Biol. Ghem. . 264:5768-5775 (1989); 

and Komada, M. et al.: J. Biochem. . 106:54 5-547 
(1989) . The 480 bp coding region is believed to be 
contained in a putative 1.2kb loop flanked by 121 bp 
perfect inverted repeats. See FIG. 2. This 

2 5 stem-loop is -flanked by perfect 121 bp inverted 

repeats. This stem-loop structure leaves about 3kb 
of 3' "tail." See FIGS. 1 and 2. Translation of 
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RNAs containing such a secondary structure cannot 
readily be explained by the conventional translation 
scanning model. See Pain, V.M. : Biochemistry J, . 
235:625-637 (1986). Nevertheless, it is believed 
that there is an internal ribosome binding site 
between the 5' repeat sequence and ATG translation 
start site. Cloning the sequence between base 116 and 
72 0, see FIG. 3, in both normal and reverse 
orientations in front of an internal luciferase gene 
which lies downstream of a CAT gene, see Macejjak, 
D.G- et al.: Nature . 353:90-94 (1991), see FIG. 20, 
followed by detecting luciferase gene expression in 
transfected Hela cells (with positive and negative 
control constructs) , confirms that the fragment does 
contain a internal ribosome binding sequence. 
Luciferase expression is significantly higher when 
the fragment is cloned in normal orientation then in 
reverse orientation. It is believed that the 
translation of mRNAs initiated by an internal 
ribosome binding mechanism may play an important role 
in mitosis, meiosis or specific viral infection, 
because cap-dependent translation during mitosis in 
mammalian cells is unlikely, due to the presence of 
underphosphorylated and therefore nonfunctional 
translation initiation factor, eif-4F. See Macejjak, 
D.G. et al.: Nature, 353:90-94 (1991). It is 
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therefore believed that the RPLA2-8 gene product 
could play a role during these processes. 

As a preliminary study, the pattern of 
RPIiA2-8 gene expression, see TABLE III/ is examined 
by screening rat tissues with reverse transcription 
followed by PGR (RT-PCR) , using primers pla8-l and 
pla8-2. See FIG. 3. 



TABLE III 



Reverse Transcription-PCR (RT-PCR) of Total RNA of 
Different Rat tissues by Primers CI08-I and Clo8-la 



1. 


Brain 




2. 


Cerebellum, Brain Stem 




3 . 


Kidney . 




4 . 


Lung 




5. 


Heart 


+ 


6. 


Muscle (?) 




7. 


Pancreas 




8. 


Small intestine 




9. 


Liver 




10. 


Prostate 




11. 


Bladder 




12. 


Spleen 




13 . 


Adrenal 




14 . 


Submaxil 1 ary 





In addition, to determine transcription patterns of 
RPLA2-8 and RPLA2-IO, a Rat Multiple Northern Blot 
(Clontech, Inc.) is probed with a 489 bp fragment, 
i.e., RPLA2-8 PCR probe, generated by PCR with 
primers RCI08-5' (bases 716-742) and Rclo8-3' (bases 
1178-1205) . A rat Multiple Northern Blot (Clontech, 
Inc.) is also probed with a 427 bp fragment, i.e., 
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RPIiA2*10 PGR probe, and amplified using primers 
RclolO-5' (bases 226-253) and RclolO-3 ' (bases 
627-653) • As seen in TABLE II, an RPIA2-8 mRNA is 
detected in testis only and an RPIiA2-10 mRNA is 
detected in heart and perhaps lung only. 

In order to determine the exon- intron 
junction sites and confirm the 121 bp direct repeat 
sequence in the genomic DNA, a 15 kb rat genomic DNA 
clone containing RPLA2-8 coding exon II is analyzed 
by Southern blot, and partial sequencing. The 15 kb 
genomic DNA structure is shown in FIG. 4. It does not 
contain exon I and the 5' 121 bp repeat, but it does 
contain the 3' 121 bp repeat. To further investigate 
the 5' rat genomic DNA sequence, a cosmid genomic DNA 
library (Clontech, Inc.) is screened using a 
PCR-generated fragment containing RPIiA2-8 exon 
I-intron I-exon II. Twelve positive clones are 
indentified. Restriction mapping indicates that all 
clones (about 40 kb each) are identical. 
Unfortunately, the cosmid clones could not contain 
the 5' 121 bp repeat because their 5' ends are 
located in intron I. Thus, RT-PCR is used to confirm 
the presence of the 5' 121 bp direct repeat sequence. 
Pla8-7, a 22 bp 5' primer starting at base 73, which 
lies within the 121 bp repeat sequence and pla8-8, a 
22 bp 3' primer ending at base 212, see FIG. 3, are 
generated to conduct RT-PCR of rat brain total RNA. 
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The resulting RT-PCR fragment is purified from the 
agrose gel and cloned into ml3mpl8, and the sequence 
is confirmed to be as predicted by the cDNA, 

To test whether this PIiA2-8 gene encodes an 
active, secreted PIiA2, an Epstein Barr virus-based 
expression vector (pCEP) is used to express the 
RPI*A2"^ cDNA in human 293s cells- pCEP contains two 
regions of the EBV genome required for episomal 
maintenance (EBNA-1 and OriP) , a drug resistance gene 
for selection in human cells (hyg) , bacterial 
sequences for maintenance in E. coli, a drug 
resistance gene for selection in E. coli (amp) , and 
an expression cassette for the production of high 
levels of mRNA from an introduced sequence by using 
an Rous/Sarcoma virus long terminal repeat (RSV LTR) 
promoter and an Simian virus 4 0 (SV4 0) 
polyadenylation signal. pR8-3', a chimeric 

construct, is constructed as follows. RASF-5', a 5' 
primer beginning with a 10 nucleotide Nhel linker 
followed by 22 nucleotides starting at base 130, and 
Ju9 , a 22 nucleotide 3' primer complementary to base 
177 and 198, see Seilhamer, J. et al.: J. Biol. 
Chem. , 264 ; 5335-5338 (1989), are used to PGR amplify 
plasmid pRASF from bases 130 to 198. pRASF contains 
the entire known PLA2 "^YP^ H cDNA. See Seilhamer, 
J. et al.: J. Biol. Chem. . 264:5335-5338 (1989). The 
PGR product is purified and is digested with Nhel 
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plus Ncol. JuRS-ll, a 5' primer with a total length 
of 31 nucleotides, beginning with GCCATGGGA followed 
by base 806 to 827 of RPLA2-8 sequence, see FIG. 3, 
and R8-3', a 3' primer starting with a 10 nucleotide 
Nhel linker at its 5' end, followed by 22 nucleotides 
complementary to RPLA2""® base 1178 to 1200, see FIG. 
3, are used to PGR amplify plasmid RPI1A2-8. The PGR 
product is purified and digested with Xhol plus 
Ncol. Both digested PGR products are then ligated 
together into Xhol-Nhel digested pCEP. Sequencing is 
carried out to confirm the nucleotide sequence of 
pR8-3'. CpR8-3 ' is a single clone of cells chosen to 
represent the typical pH optimum and Ca dependence 
of CpR8 transfected 293s cells. The effects of pH 
and calcium concentration on enzyme activity are 
illustrated in FIGS. 23 and 24, respectively, for the 
RPIiA2-8 enzyme (Type III) and are similar, but 
different to the pH and calcium profiles for the 
HPLA2-10" enzyme (Type IV) encoded for by the HPLA2-IO 
cDNA cloned into plasmid cPHlO, as shown in FIGS. 25 
and 26, respectively. In other words, RPIiA2-8 also 
appears to be a pH and calcium-dependent PLA2 enzyme 
having activity starting at about pH 5.5 and having 
significant activity at between about pH 7 and about 
pH 9 and having activity starting at about 0.1 mM 
Ca^"*" and having significant activity at between about 
0.3 mM and about 2 mM Ca^"*", respectively. The 



SUBSTITUTE SHEET (RULE 26) 



PCT/US94/07926 

-39- 

activity of RPLA2-8, however, apparently 

progressively declines at a pH of greater than about 
9 and at a calcium concentration of greater than 
about 2 mM. Nonetheless, FIGS. 2 3-26 illustrat 
phsopholipase activity for the Type III and Type IV 
phospholipase enzymes of the present invention- 
Moreover, FIGS. 23-26 show that the pH and calcium 
profiles for the Type III and Type IV phospholipase 
enzymes of the present invention are different from 
the pH and calcium profiles for phospholipases known 
heretofore. 

It should be appreciated by those skilled 
in the art that the novel PIiA2 Type III and Type- IV 
enzymes described in the instant application may have 
many different potential uses. 

Although both "Type II" soluble PLA2 and 
intracellular membrane-associated PIA2 have been 
shown to mediate many aspects of the inflammatory 
cascade, it may well be that the new PLA2 enzymes may 
also play a role, either by directly functioning to 
liberate arachidonic acid and 2-lysophospholipid, or 
by replacing the functions of the former in tissues 
and/or individuals in which the enzymes may be 
otherwise missing. As such, inhibition of these new 
enzymes by standard strategies known in the art 
(e.g. , crystallography-based rational drug design; 
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antisense; triple helix; monoclonal antibodies) could 
be valuabl in anti- inflammatory therapy. 

Phospholipases A2 are involved in other 
processes vital to sustaining life in humans, 
5 including but not limited to pulmonary surfactant 

turnover, biomembrane maintenance and metabolism, 
various lipid catabolic pathways, platelet activation 
factor metabolism, and sperm-mediated egg 

activation. First, it is possible that certain 

10 diseases present today involve alterations in these 

functions, and could be treated therapeutically with 
exogenously added recombinant PIiA2 or anti-PLA2* 
Second, as new PIiA2-inhibiting anti-inflammatory 
therapeutics are developed, many may exhibit 

15 cross-inhibition with these other new enzymes, 

thereby causing undesired side-effects. Both 
knowledge of the sequence/ structure of these new 
enzymes, and the ability to restore their function 
through addition of the appropriate recombinant 

2 0 enzyme could be of value in reducing such 

side-effects . 

Although these enzymes have been 
characterized as PIiA2 enzymes, they may well have 
other vital enzymatic activities. For example, LCAT 

25 (lecithin-cholesterol acyl transferase) also exhibits 

PLA2 activity. Alternatively, these enzymes may 
function as phospholipases Al, phospholipases B, 
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phospholipases C, lysophopholipases , acyl hydrolases; 
ribonucleases, lipases, or ph sphodiesterases, all of 
which are esterases which resemble phospholipase A2 
in chemical activity. If this is the case, these new 
enzymes could be used to treat defects in a variety 
of metabolic pathways. 

PIiA2 also useful in the food processing 

industry. See Dutilh et al.: J, Sci. Food Aaricul . . 
32:451-458 (1981), and in the preservation of fish, 
see Mazeaud et al.: J. Fish Res. Board Cun. , 
33:1297-1303 (1976). Recombinant forms of the 
instant new PLA2S may be useful to replace natural 
sources of these enzymes. 

RPIiA2-8, by virtue of its specific 
synthesis in rat testis, may play a key role in 
activation during fertilization by sperm. Therefore, 
antagonism of its function may prove useful as a 
specific anti-fertility reagent in pests such as 
rodents . 

HPLA2IO and RPIiA2-10, by virtue of their 
specific synthesis in cardiac tissue, may play a key 
role in cardiac lipid metabolism specific to cardiac 
tissue, and may indicate a specialized new function 
for this enzyme. A major component of heart tissue 
is cardiolipin, and Type IV phospholipase may mediate 
metabolism of this related diphospholipid in this 
organ. Therefore, recombinant forms of the new PLA2S 
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could prove useful in the treatment of disorders 
involving cardiac phospholipid metabolism. 

In addition, the new PIiA2S have been mapped 
into a genetic locus known to be associated with 
Batten's disease (or Neuronal Ceroid Lipfuscinosis; 
NCL) • Since the latter disorder has been shown to 
involve alterations in activity of certain 
phospholipases, see Dawson et al . : Advances in 
Experimental Medicine & Biology . 266:259-270 (1989), 
these new enzymes may be useful as a therapeutic to 
treat the former, and as a diagnostic to detect the 
presence of these genetic abnormalities so that 
proper counseling and early treatment of the disease 
would be possible. 

Examples of various embodiments of the 

present invention will now be further illustrated 

with reference to the following Examples. 

Example I - CpCHlO-lD Cell Line Trans fected 
with pCHIO which Expresses HPLA 2 "1Q 

Total RNA is prepared according to the 

method of Chomcyzmski and Sacchi: Analytical 

Biochemistrv . 162:156-159 (1987). 5' and 3' RACE-RT 

PGR techniques are used to generate the full length 

cDNA from total human brain RNA as described by 

Ishisaki: Biochem. Biophvsic. res . Comm. , 

162:1030-1036 (1989), and outlined in FIG. 10. PGR 

amplifications are done using 3 0 cycles at 95 'C for 

20 seconds, 60 'G for 20 seconds and 72 'C for 75 
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seconds in 100 lil of buffer containing a final 
concentration of 1.5 mM MgCl2/ 200 liM dNTP, 100 mM 
Tris-HCl, pH 8.3, and 3 units T^aq polymerase. Anchor 
(300 ng) and adaptor (50 ng) primers are used in both 
5' and 3' RACE-RT PCR. Primers HIO-C (300 ^g) and 
HlO-la (300 \ig) are used for 5' RACE-RT PCR. Primers 
HIO-A (300 \xg) and HlO-1 (300 ^g), see FIG. 10, are 
used for 3' RACE-RT PCR. Primer sequences are listed 
in TABLE IV. 

TABLE IV 



Primers Sequences _ 

HIO-A CTGGCTTGGTTCCTGGCTTGTA:^^ 

HlO-1 GCAAGGAGGCTTGCTGGACCTA^j 

HI 0 - 1 a ATCGGTGCCATCCTTGGGGGTT ^ ^ 

HIO-C GCAGAGGATGTTGGGAAAGTAT-*-^ 



HlO-5 • GAATTCGCTAGCCAGAGATGAAAGGCCTCCTCCCACTGGCTTGG 
HlO-3 • CTCGCTCTCGAGGCCCTAGGAGCAGAGGATGTTGGGAAA^^ 
Ancho r GGCCACGCGTCGACTAGTAC ( T) J; ^ ^ 7 
Adaptor GGCCACGCGTCGACTAGTAC^ O 



^-^represents SEQ ID NO: 13:; ^^represents SEQ ID 
NO: 14:; ^^represents SEQ ID NO: 15:; -''^represents SEQ 
ID N0:16:; -^^represents SEQ ID N0:17: ^°represents 
SEQ ID NO: 18:; -^-^ represents SEQ ID NO: 19:; 
^^represents SEQ ID NO:20:. 

6x10^ clones from a human stomach cDNA 
phage library (Clontech, Inc.) and 5x10^ clones from 
a rat genomic DNA cosmid library (Clontech, Inc.) are 
screened according to the procedures provided by 
Clontech Inc. 

A Human Multiple Northern Blot (Clontech, 
Inc.) is hybridized according to the manufacturer's 
directions . 

293s cells (ATCC CRL 1573) are grown in 
Dulb cco's modified Eagle's medium (DMEM) 
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suppleroented with 10% fetal bovine serum. 
Approximately 7.5x10^ cells are transfected with 10 
of purified supercoiled plasmid DNA from either 
5 pCHlO or pRASF to create cell lines of the type 

CpCHlO-lD and CpRASF-2B, respectively, according to 
the methods of Kingston, R.E. : calcium Phosphate 
Transfection in Current Protocols in Molecular 
Biology, ed. Frederick M. Ausubel et al,, pp. 

10 9,1,1-9.1.3 (1989). Twenty-four hours after 

transfection, 2 00 units per ml of hygromycin is added 
to the medium. Stably-transf ected, 

hygromycin-resistant colonies are selected ten days 
after transfection and are maintained in DMEM 

15 containing 200 units per ml of hygromycin. To test 

for PLA2 activity, 2.0x10^ cells are plated in a 25 
cm^ flask and medium is collected 24, 48 and 72 hours 
after plating. 

Autoclaved [l-^'^C] oleic acid-labeled 

20 Escherichia coli (E. coli) JM109 is prepared 

according to the methods described by Elsbach, P. et 
al.: Methods in Enzvmoloay , 97:24-31 (1991) for use 
as a PIiA2 substrate. Briefly, 2 0 \il medium is 
incubated for 15 minutes at 37 'C with E. coli 

25 substrate (a mix of 2.5x10® labeled and unlabeled 

bacteria to provide 10,000 cpm) in a total volume of 
250 vil (40 mM Tris/HCl, pH 7.8, 150 mM NaCl, 10 mM 
Ca^"*") . The reaction is stopped by the addition of 
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250 ul ice cold 0-5% (W/V) fatty acid-poor BSA 
(USB) • After incubation on ice for 5 minutes, the 
samples are centrifuged at 10,000 x g for 3 minutes 
and 250 >i 1 of the supernatant containing released 
5 oleic acid is counted in a scintillation 

counter . 

The pH optimiun for human Type IV PIA2 
enzyme activity is determined using 2 0 \il of medium 
diluted to produce approximately 10% substrate 

10 hydrolysis. Sodium acetate buffer (final 

concentration 2 5 mM) is used for the pH range 4-6-5 
and Tris/HCl buffer (final concentration 25 mM) ' for 
the pH range 7-9. See FIG, 25- 

The calcium dependence of the human Type IV 

15 enzyme activity is examined in the calcium 

concentration range 0-400 mM. The buffer solution 
(Tris/HCl, pH 7.5, final concentration 25 mM) is 
prepared with doubly distilled, deionized water which 
contained a minimal amount of metal ions. EDTA (300 

2 0 mcM) is added to the assay mixture in order to 

chelate the residual calcium. 20 \l1 of medium is 
diluted to produce 10% substrate hydrolysis. See 
FIG. 26. 

Example II - CpR8-3'Cell Line Transfected 
25 With PCR8 Which Epxresses RPIA2 -8 

293s cells (ATCC CRL 1573) are grown in 

Dulbecco's modified Eagle's medium (DMEM) 

supplemented with 10% fetal bovine serum. 
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Approximately 7.5x10^ cells are transfected with 10 
of purified supercoiled plasmid DNA from pR8-3' to 
create a cell line of the type CpR8-3' according to 
the methods of Kingston, R.E.: Calcium Phosphate 
Transfection in Current Protocols in Molecular 
Biology, ed, Frederick M. Ausubel et al . / pp. 
9.1.1-9.1.3 (1989). Twenty-four hours after 

transfection, 200 units per ml of hygromycin is added 
to the medium. stably-transf ected, 

hygromycin-resistant colonies are selected ten days 
after transfection and are maintained in DMEM 
containing 2 00 units per ml of hygromycin- To test 
for PIA2 activity, 2.0x10^ cells are plated in a 25 
cm^ flask and medium is collected 24, 48 and 72 hours 
after plating. 

Autoclaved [1-^^C] oleic acid-labeled 
Escherichia coli (E. coli) JM109 is prepared 
according to the methods described by Elsbach, P. et 
al.: Methods in Enzvmoloav , 97:24-31 (1991) for use 
as a PLA2 substrate. Briefly, 2 0 |il medium is 
incubated for 15 minutes at 37 ''C with E. coli 
substrate (a mix of 2.5x10® labeled and unlabeled 
bacteria to provide 10,000 cpm) in a total volume of 
250 }xl (40 mM Tris/HCl , pH 7.8, 150 mM NaCl , 10 mM 
Ca^"*") . The reaction is stopped by the addition of 
2 50 ^l ice cold 0.5% (W/V) fatty acid-poor BSA 
(USB) . After incubation on ic for 5 minutes, the 
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sainples are centrifuged at 10,000 x g for 3 minutes 
and 250 \x 1 of the supernatant containing released 
(1-l^C] oleic acid is counted in a scintillation 
counter . 

5 The pH optimxim for human Type III PLA2 

enzyme activity is determined using 2 0 ^l of medium 
diluted to produce approximately 10% substrate 
hydrolysis. Sodium acetate buffer (final 

concentration 2 5 mM) is used for the pH range 4-6.5 

10 and Tris/HCl buffer (final concentration 25 mM) for 

the pH range 7-9. See FIG. 23. 

The calcium dependence of the human Type 
III enzyme activity is examined in the calcium 
concentration range O-400 mM. The buffer solution 

15 (Tris/HCl, pH 7.5, final concentration 2 5 mM) is 

prepared with doubly distilled, deionized water which 
contained a minimal amount of metal ions. EDTA (3 00 
mcM) is added to the assay mixture in order to 
chelate 'the residual calcium. 20 ^1 of medium is 

2 0 diluted to produce 10% substrate hydrolysis. See 

FIG. 24. 

Example III - PIA o Activity 

7.5 X 10^ 293s cells are transfected with 
10 ug of supercoiled plasmid DNA according to the 
25 method of Kingston, R.E. : Calcium Phosphate 

Transfection in Current Protocols in Molecular 
Biology. ed. Frederick M. Ausubel et al., pp. 
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9.1.1-9.1,3 (1989), Hygromycin-resistant colonies 
are selected 10 days after transfection and ar 
maintained in DMEM containing 2 00 units of 
hygromycin. CpCHlO-lB, CpCHlO-lC, CpCHlO-lD and 
5 CpCH10-2G are independent, hygromycin-resistant cell 

lines transfected with pCHlO, a plasmid containing 
the human Type IV cDNA; CpRASF-2B is a 

hygromycin-resistant cell line transfected with 
pMCHS, a plasmid containing the known Type II PIiA2 

10 gene. CpR8-3' is a hygromycin-resistant cell line 

transfected with pR8-3', a plasmid containing the rat 
Type III PLA2 cDNA. These cell lines are tested two 
months after their stable transfection. Each has 
been maintained and subcloned in 

15 hygromycin-containing medixam. For this experiment, 

exponentially growing cells are plated at 4 x 10^ 
cells per ml. Medium samples are taken 24, 48 and 72 
hours after plating. 2 0 pi of each medium sample is 
assayed under standard conditions, see Elsbach, P. et 

20 al.: Methods in Enzvmoloav . 197:24-31 (1991) for PLA2 

activity. Activity is expressed as a fraction of 
autoclaved [1-^^C] oleic acid labeled E. coli yi090 
incubated alone. See FIG. 21. 

Example IV - Searching for human cDNA and Genomic 
2 5 DNA Sequences homologous to RPIA o -S 

Two primers, clo8-4 and clo8-5, synthesized 

according the published human h8 presumptive exon II 

sequence, Seilhamer, J.J.: J. of Cellular 
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Biochemistrv . 39:327-329 (1989), are used in a PGR 
amplification screen of human child brain, adult 
brain, liver, heart, and various white cell cDNA 
libraries. No PGR amplification is obtained from any 
of them. Two overlapping human genomic DNA clones, 
clone 8 and walk 9, containing 10 kb of DNA 5' of h8 
exon II and 16 kb of DNA 3' of h8 exon II, 
respectively, are then analyzed. Southern blot 
analysis using the PGR fragment containing the 
RPLA2-8 open reading frame DNA sequence as a probe 
identified two EcoRI fragments, one in clone 8 and 
one in walk 9 . These two fragments are subcloned into 
pUC19 and sequenced. DNA secjuence homology between 
these sequences and the RPIiA2-8 cDNA indicated that 
one fragment contains a region homologous to RPLA2--8 
exons I and II, and that the other fragment contains 
a region homologous to RPIiA2-8 exon IV. See FIG.- 16. 
In order to search for exon III of a human RPIiA2-8 
homologue, the entire region between exon II and exon 
IV is sequenced. No region homologous to RPLA2-8 
coding exon III is found by computer analysis of this 
sequence. To determine if the HPIiA2-8 sequence is 
transcribed, two primers, one in coding exon II and 
one in exon IV, are used to do RT-PGR of human brain 
and lymphoblast total RNA. No PGR amplification 
signal is obtained. 
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Example V - Phospholipase A2 assay using aut - 
claved Jahio^l^ d bacrte ^^^T^^ a s a stib s1:rat:e 

Autoclaved oleic acid-labeled E.coli 

1-^^C 109 is prepared according to the methods 

5 described by Elsbach: P. et al.: Methods in 

Enzvmoloav , 197:24-31 (1991) for use as the PLA2 

substrate. Commercial porcine pancreatic PLA2 (Sigma) 

is used for the standand assay. Simply, the serialy 

diluted PLA2 solutions are incubated for 15 minutes 

10 at 37 "C with E.coli substrate (a mix of 2-5x10^ 

labeled and unlabled bacteria to provide 10,000 cpm) 
in a total volume of 250 ul (40mM Tris/HCl, pH 7.8, 
lOmM Ca"^^) . The reaction is stopped by the addition 
of . 250 ul ice cold 0-5% (W/V) fatty acid-poor BSA 

15 (USB) . After incubatation on ice for 5 minutes, the 

samples are centrifuged at 10,000 x g for 2 minutes, 
and 250 ul of the supernatant containing released 
[1~^^C] oleic acid is counted in a scintillation 
counter. 
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SEQUENCE LISTING 



CD GENERAL INFORMATION: 

(i) APPLICANT: Tischfield, Jay A. 

Seilhaaer, Jeffrey J, 

(ii) TITLE OF INVENTION: Mammalian Phospholipase A2 Nucleotide 
Sequences and Low Molecular Weight Amino Acid Sequences 
Encoded Thereby, Antisense Sequences and Nucleotide 
Sec[uences Having Internal Ribosome Binding Sites 

(iii) NUMBER OF SEQUENCES: 44 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Ruden, Bamett, McClosky, Smith, Schuster & 

Russell PA 

(B) STREET: 200 East Broward Boulevard 

(C) CITY: Fort Lauderdale 

(D) STATE: FL 

(E) COUNTRY: USA 

(F) ZIP: 33301 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

CD) SOFTWARE: Patentin Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/097,354 

(B) FILING DATE: 26-JUL-1993 

(C) CLASSIFICATION: 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Manso, Peter J. 

(B) REGISTRATION NUMBER: 32,264 

(C) REFERENCE/ DOCKET NUMBER: IN2 1044-5 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 305-527-2498 

(B) TELEFAX: 305-764-4996 



(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 auaino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

Met Lys Leu Leu Val Leu Ala Val Leu Leu Tbr Val Ala Ala Ala 
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15 10 15 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Asp Ser Gly lie Ser Pro Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Ala Val Trp Gin Phe 
1 5 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Lys Thr Leu Leu Leu Ala Val lie Met lie Phe Gly Leu Leu Gin 
^5 10 15 

Ala His Gly 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 5 euttino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Asn Leu Val Asn Phe 
1 5 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Asp Leu Leu Val Ser Ser Gly Met Lys Gly lie Ala Val Phe Leu 
15 10 15 

Val Phe lie Phe Cys 
20 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Trp Thr Thr Ser Thr Leu Ser 
1 5 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8; 

Ser Phe Trp Gin Phe 
1 5 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 auaino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Met Lys Gly Leu Leu Pro Leu Ala Trp Phe Leu Ala Cys Ser Val Pro 
15 10 15 

Ala Val Gin Gly 
20 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Gly Leu Leu Asp Leu 
1 5 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Met Lys Arg Leu Leu Thr Leu Ala Trp Ph Leu Ala Cys Ser Val Pro 
IS 10 15 



SUBSTITUTE SHEET (RULE 26) 

BNSDOCID: <WO 9502326^* i ,!_> 



wo 95/02328 



PCT/US94/07926 



-55- 



Ala Val Pro Gly 
20 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Gly Leu Leu Glu Leu 
1 5 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
CTGGCTTGGT TCCTGGCTTG TA 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
GCAAGGAGGC TTGCTGGACC TA 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) HOLECXJliE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
ATCGGTGCCA TCCTTGGGGG TT 22 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
GCAGAGGATG TTGGGAAAGT AT 22 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
GAATTCGCTA GCCAGAGATG AAAGGCCTCC TCCCACTGGC TTGG 44 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
CTCGCTCTCG AGGCCCTAGG AGCAGAGGAT GTTGGGAAA 39 
(2) INFORMATION FOR SEQ ID NO: 19: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDKESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
GGCCACGCGT CGACTAGTAC T 21 
(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
GGCCACGCGT CGACTAGTAC 20 
(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4325 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 722.. 1195 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

GAATTCCGCC TCCACCTCTC AAATGCTGGG ATTGCAGGAT GTCCCCCCAC CCCTGCTCCC 60 

TTGTGTCCTT GCTTCCTGCT GCCGGAATGT ATCACTTAAT TGCCAGGTAC CCATGGTCTG 120 

ATTCCAGGAT AGAAGGGCGG GCGAGGGGGT TGGAGGAGAG GCCTCTATTA TTTCCGCGGT 180 

CTGGCAGGCC TGGAAGCAAA GCTTCAAGTG CAGAAGGAGG AGTGTCGGGG AATGGCAGAA 24 0 

AAGGCTGGAA CAGCAATGCA GACCTAGGTA AAGGGCACAG AGCTGAGGGA AGCTCCTGGG 300 

AGGCTCCCTG CAGCTCCTGC CTCTGCACAT GACCCGGACT CCTTTTCTCT CTTTGGATCT 360 

GCGTCCAGGG ACTGGCTTGT ACACACCCCT CCCAGGAGAC CCCTTGGCAG CTGCACACTC 42 0 
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AGGCTCCATC CAAGTTGGCT CTGCCCCTGG GGAAGGCTGC TCAAAAGGCC TGGCTCCCAG 480 

TTTCTGGGGA CCCACAGAGA GCCTCTCACC TCGCAGCTCA GCTCCATCCG CCTCCTGTGC 540 

CTGGCTGCGA CCAGCTGGGT CTAACTATAG ACAGTCAGCA ACTTCAGCCA CTTCACCGAG 600 

TTTCCCAACA GCTTTGAGAT TTGGAAGCCG GAAGCCTGAT CGCCTTCTCA GAAGCTACGG 660 

TCCACTACCT CAGCCATTCT GTTGGAGCTG AACTGGCAGA TGAAGGTGAG ACCCAGGCAC 720 

C ATG GAC CTC CTG GTC TCC TCA GGA ATG AAG GGC ATC GCT GTC TTC 766 
Met Asp Leu Leu Val Ser Ser Gly Met Lys Gly lie Ala Val Phe 
^5 10 15 

CTT GTC TTT ATC TTC TGC TGG ACA ACC TCC ACC CTC AGC AGC TTC TGG 814 
Leu Val Phe He Phe Cys Trp Thr Thr Ser Thr Leu Ser Ser Phe Trp 
20 25 30 

CAG TTC CAG AGG ATG GTC AAA CAC ATC ACG GGG CGC AGC GCC TTC TTC 862 
Gin Phe Gin Arg Met Val Lys His He Thr Gly Arg Ser Ala Phe Phe 
35 40 45 

TCC TAT TAC GGA TAT GGC TGC TAC TGT GGG CTT GGG GGC CGA GGG ATC 910 
Ser Tyr Tyr Gly Tyr Gly Cys Tyr Cys Gly Leu Gly Gly Arg Gly He 
50 55 60 

CCT GTG GAG GCC ACA GAC AGG TGC TGC TGG GCT CAT GAC TGT TGC TAC 958 
Pro Val Asp Ala Thr Asp Arg Cys Cys Trp Ala His Asp Cys Cys Tyr 
65 70 75 

CAC AAG CTT AAG GAA TAT GGC TGC CAG CCC ATC TTG AAT GCC TAT CAG 1006 
His Lys Leu Lys Glu Tyr Gly Cys Gin Pro He Leu Asn Ala Tyr Gin 
SO 85 90 95 

TTT GCG ATT GTC AAC GGG ACC GTG ACC TGT GGA TGC ACC ATG GGT GGC 1054 
Phe Ala He Val Asn Gly Thr Val Thr Cys Gly Cys Thr Met Gly Gly 
100 105 110 

GGC TGC TTG TGC GGG CAG AAA GCC TGT GAG TGT GAC AAA CTG TCT GTG 1102 
Gly Cys Leu Cys Gly Gin Lys Ala Cys Glu Cys Asp Lys Leu Ser Val 
115 120 125 

TAC TGC TTC AAG GAG AAC CTG GCC ACC TAC GAG AAA ACT TTC AAG CAG 1150 
Tyr Cys Phe Lys Glu Asn Leu Ala Thr Tyr Glu Lys Thr Phe Lys Gin 
130 135 140 

CTC TTC CCC ACC AGG CCC CAG TGT GGC AGG GAC AAA CTC CAT TGC 1195 
Leu Phe Pro Thr Arg Pro Gin Cys Gly Arg Asp Lys Leu His Cys 
145 150 155 

TAGGCCTTCC CCTCCAAGAG TCCCCAGGCT CCTGCAGCTC AGCCTTGCTG TCTAGGGAGT 12 5 S 

GTCTTCTCAG GCATTAGGGG ACCGGAGGTG GAGAATTCCT GCCCTGGAAT CAGACCATGG 1315 

GTACCTGGCA ATTAAGTGAT ACATTCCGGC AGCAGGAAGC AAGGACACAA GGGAGCAGGG 1375 

GTGGGGGGAC ATCCTGCAAT CCCAGCATTT GAGAGGTGGA GGCAAGAGGT GGGGGGTAGC 14 35 

CTCCACTATA CGGTAAGTTC AAGGCTAACC TGAGCTACCT GAGACCTTGC CTTGAAAAAA 1495 
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1555 
1615 
1675 



1795 
1855 



2095 
2155 
2215 



CTTTTTTAAA AAACGTTTAA AGGAAAAGAA AACAGAAAGA CACGGGGACT GGGCTGAAAG 
GTACTCTCAA ACCGATTTCC CAGGAAGAGC GGAGAGCCCC AGGTTCAGCT CCAGCCTGAA 
CTCCCCCATA CCCTCAGTCC TGGTCAGGAT GTGTGTCTGA CTGGGGAACC AAGTCCTCCA 
CCCGGGTGGA GCTTAGCTGG GAACTACGCA GGTGTCCTAG AAAATACAGT CCTAAGAGCC 1735 
TCACCCGGAG TCTCATCCCC ATTTGCTCCA GGACTGACCT CTGTAAATCT TCCAGCAGGA 
AGCAGGCTGT ACCCATCTCA GGAGGTGGGG TGCTGTTAGA ACAATGGTGT GCACCAGTGA 
CACAAAGATG TCATGGTTAA GATGGCATCA AGAAGTGGAA AGGACATTCG GAACAGTGGG 1915 
TCCAAGGCAC CCAAAGTCCT CACCCCAATT TAGAAGCCGT TGGTCCTGTA AGACTTAAAT 1975 
CTACTAAACA AGGAAGGTCT AACTGGGCTG GAATCTGAAG TTCATGGTGC CAGGCTGGGG 2 03 5 

CGGTGGGTGG GGACGTGGCC GTGGCCATGA CCATGATTGC CTCTCTGCAT GGTGACACTT 
GCCTTTTGCA CCCTAGCTCT CAGCACATCT GAAAAGGACA GACTCTCCTG TTCATTCCTT 
GAATCTGAGA CTCTCCTCAC TAATGTAGCA AAAATGGAGG TCTAAAGTGC AGGCTTCAGC 
CTCTGAGGTC CAGGGCAGGA GGAAGCTGGG GCTCAGCCTC CTGGAGGATG AGAGCTTGCC 2275 
GGGTGAGCAT CAGCGACAGC AGACCCTTGG GCTCAGAGAG TCCGCAAGCC TGGGAGAGCC 2335 
TGGCCGAGCC CTGACTGCAG CACACAGAGC CGTGAGCCTC ATACAAGAAG CCACATTTTG 2395 
GGGAAGCTTC AGGGTGGCTG ATTCCACAGC TGTTGGGTTC AGAACGGAAG CCGGGAGCAC 24 55 
TCACTTCAGA TATGGAAGCT TTGTTTTACG AGCGCTTAGC ACCAGTTCAG GATCTGAACT 2515 
TCGTCCTGAC CGGAGAGTCC GTACCACATT TTTATAGGAT GGGAACACAG AGCGAGGGGC 
GTGGAGTAAG CTGTTGAACG ACCGATCATA TTTTGACCTA AGAGGTTAAG TAAGGACGTT 
AACATGGGTG ACTGGGCATT AGTCAGGTCA CCTGGTTTTG GGGTCTTTGA ATCAGCTTTC 
GTGGCCAGGT CCCTTCCTGG ACTTTGGCTC GGAATTTAGA ACGATAAGGG AACGAAGAGG 
TGGGCAAGCT TCGGGCAGTC AGTAAGAGGC AGCACATTCA TGACCTGTGT GCCTTGTTTA 
GATAATGGGA TAAGAGTATC TCCTCTCTTA CACCCCTTAC TGGTTAACAG ACAAACACGA 2875 
GACATCTGAA GAAGCAGGAC AGGAGTTAGG TTCTGGGGCA CAGGAACATG AACTCGGTTT 2935 
TGATCCTGCC GGCAAGGTGG ATCTTGTTCC TGAGAAGGCT GGACTCAGGA AACTTCCTCT 2995 
TAACAAGTTA GTTGATGGCG CTGGTCCTTA GTCACCGATA CTGTCAGGCT CTCAGCTCTT 3055 
GGGCCAGACT TGGCGGCCAT GGGAGTGTGG TCACTTGCCC CGTCCCCTTC TTCCAGGAGG 3115 
TACTGGGGAA AATGGTTGGA TTTGTGGAGT TGTAGGGAAC ACTCATGGCT CCCTTCACTT 3175 
AGTAGGTCAG CTAACATATG TGTATCGAGC CCATACCGTG TGCCATGTGC AGTGCTGAGC 3235 
AGCAGGGAGT CAGAGATTTA AAGACACACA CACAGACTTC AAGTCTGAGA ATTTTGAATC 3295 



2575 
2635 
2695 
2755 
2815 
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CCAGGGAGAA CCGCTGAGAG CCATGGCGCT TCTACCAATG CCAGAGGCTA ACACCCGGAC 3 3 55 

TGAGAAAACT AAGCACGAGG AGACAGCAGG GTCAGCAGAG GGCCTGGGAG CTAGGGCCCT 3415 

GAGCAGTACC TAGTTCAAAT CACAGAGTCG TCTTTCTTCC TCCACCCTAC CCAGGTACAG 3475 

CAAGTAGACA CGGGTGGGGG CAGCGCAGGG ATGCAGGAAC ATTAGGGCAC ACCGATGTGG 3535 

CTAGGCTAAG CTAGAGCATG TTACCTTCTC AGGGGTCCTG TCATGTCAGA GACTGGTTCC 3595 

AACCTGGAAA GATGTCTGAG TGACAGCTGT GGTAGAAGAA GAGAGGCCAG GGTGATATCA 3655 

GCATGAAGGG CTGGATTGCT ATGTGAGATC CAGATCTCTT CTGCCACTGG GGTCAGCTTC 3715 

TACACTGGAA ATAGATGGGC TGCGTTATGG AGGGTGGTGT GAGTCCCTGT CTGCGTTGTG 3775 

CCGGGAATCA GAGCAGAGTG TTAGCGCTGT AAAAGGACAT GCTGGTGTTT GCAGGAAATC 3835 

ATCGATTTCT TGGAAGGGCA GCCATTCATC TACACCAGGG ATTGACTTTA TGCCAGGCTT 3895 

GTGATGAGGG TAGAAAAGTA GAAATTCTGT CCGCTGCAAG GAGCAGTCAG AGGACACAAG 3955 

GACCAAATAG CTTGGGAGTT GCGGAAGTAG GTGTCTGCTG AGGGAGCAGT GACCACTGGG 4015 

GGAAAGGCTC CTTCAAGGAA TTCAGGGACA GGGGTGAGGG CTSACATCTT GCCTGAGACC 4075 

CTAAAGAAGA GAAGGAGTTG AGAGGGCTGA GTATGCTGTG TGGAGCCCCA CCCCCACCCC 4135 

CACCCCCACC CCCACCCCAG GTATATGGAT GGAGGATAAT GCGGGGGTCG GGTTCCTCTC 4195 

AAATCCATCA TCCCACCTTC GAGCTGCTGG CACGGCCTTG CCAGCACAGC CCGATTCTGT 4255 

GTTGACAAAA TACTCGAACG AAATGATTAC ATGCAAATAA AATGCAAGAG GAAAAATCTA 4 315 
AACGGAATTC 

4325 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 158 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:22: 

Met Asp Leu Leu Val Ser Ser Gly Met Lys Gly He Ala Val Phe Leu 
1 5 10 



15 



val Phe He Phe Cys Trp Thr Thr Ser Thr Leu Ser Ser Phe Trp Gin 
20 25 30 

Phe Gin Arg Met Val Lys His He Thr Gly Arg Ser Ala Phe Phe Ser 
35 40 45 

Tyr Tyr Gly Tyr Gly Cys Tyr Cys Gly Leu Gly Gly Arg Gly He Pro 
50 55 so 
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Val Asp Ala Thr Asp Arg Cys Cys Trp Ala His Asp cys Cys Tyr His 
65 70 75 80 

Lys Leu Lys Glu Tyr Gly cys Gin Pro lie Leu Asn Ala Tyr Gin Phe 
85 90 95 

Ala He Val Asn Gly Thr Val Thr Cys Gly Cys Thr Met Gly Gly Gly 
100 105 110 

Cys Leu Cys Gly Gin Lys Ala Cys Glu Cys Asp Lys Leu Ser Val Tyr 
115 120 125 

Cys Phe Lys Glu Asn Leu Ala Thr Tyr Glu Lys Thr Phe Lys Gin Leu 
130 135 

Phe Pro Thr Arg Pro Gin Cys Gly Arg Asp Lys Leu His Cys 
145 150 155 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 67 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
ACCTCAGACC CCCTGGTCTC CTCAGGAATG AAGGTCATTG CCATCCTCAC CCTCCTCCTC 
TTCTGCT 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 67 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 
ACCATGGACC TCCTGGTCTC CTCAGGAATG AAGGGCATCG CTGTCTTCCT TGTCTTTATC 60 
TTCTGCT 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 151 base pairs 

(B) TYPE: nucleic acid 



60 
67 
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(C) STRANDEDKESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID MO: 25: 
TGGTGGCAGC CCCCACCCAC AGCAGTTTCT GGCAGTTTCA GAGGAGGGTC AAACACATCA 
CGGGGCGAAG TGCCTTCITC TCATAITACG GATATGGCTp CTACTGTGGG CTTGGGGATA 
AAGGGATCCC CGTGGATGAC ACTGACAGGT G 
(2) INFORMATION FOR SEQ ID NO: 26: 

(X) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 151 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
CD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
CAGGGACAAC CTCCACCCTC AGCAGCTTCT GGCAGTTCCA GAGGATGGTC AAACACATCA 
CGGGGCGCAG CGCCTTCTTC TCCTATTACG GATATGGCTG CTACTGTGGG CTTGGGGGCC 
GAGGGATCCC TGTGGACGCC ACAGACAGGT G 
(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS' 

(A) LENGTH: 170 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
TAGGTGGATG CACCCTTGGT CCTGGTGCCA GCTGCCACTG CAGGCTGAAG GCCTGTGAGT 
GTGACAAGCA ATCCGTGCAC TGCTTCAAAG AGAGCCTGCC CACCTATGAG AAAAACTTCA 
AGCAGTTCTC CAGCCGGCCC AGGTGTGGCA GACATAAGCC CTGGTGCTAG 
(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 167 base pairs 

(B) TYPE: nucleic acid 



60 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
CAGGTGGATG CACCATGGGT GGCGGCTGCT TGTGCGGGCA GAAAGCCTGT GAGTGTGACA 60 
AACTGTCTGT GTACTGCTTC AAGGAGAACC TGGCCACCTA CGAGAAAACT TTCAAGCAGC 120 
TCTTCCCCAC CAGGCCCCAG TGTGGCAGGG ACAAACTCCA TTGCTAG 167 
(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1828 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 233,. 643 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

GAATTCCGGT GGATGGAGGG GGCTGAGCAG GATGTTGACT GGCTATCGTT CATTGAGCAC 60 

TCTCACGATC AGCATCACGC ACGGAATCCA TCCTTCCTGT GTTGCAGCTT GTAGACCCTG 12 0 

ATGCTTGGGC TGCCAGCATA AACGTGGGGA TCCAGACTCT GTCTACCGAG GCTGCCCATA 18 0 

GGGACAGGCC CTGGGAAGAG GAGCTGAGAC CAGGCTAAAA AGAACCCAAG AA ATG 23 5 

Me^ 
1 

AAG CGC CTC CTC ACG CTG GCT TGG TTC CTG GCT TGC AGT GTG CCT GCA 283 
Lys Arg Leu Leu Thr Leu Ala Trp Phe Leu Ala Cys Ser Val Pro Ala 
5 10 15 

GTC CCA GGG GGC TTG CTA GAA CTG AAG TCC ATG ATT GAG AAG GTG ACT 331 
Val Pro Gly Gly Leu Leu Glu Leu Lys Ser Met lie Glu Lys Val Thr 
20 25 30 

GGG AAG AAT GCC GTA AAG AAC TAT GGC TTC TAC GGC TGC TAC TGT GGC 379 
Gly Lys Asn Ala Val Lys Asn Tyr Gly Phe Tyr Gly Cys Tyr Cys Gly 
35 40 45 

TGG GGC GGC CAC GGG ACC CCT AAG GAT GGC ACT GAT TGG TGC TGT CGG 427 
Trp Gly Gly His Gly Thr Pro Lys Asp Gly Thr Asp Trp Cys Cys Arg 
50 55 60 65 

ATG CAC GAC CGT TGT TAT GGG CTA CTG GAG GAG AAA CAC TGT GCC ATC 475 
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Met His Asp Arg Cys Tyr Gly Leu Leu Glu Glu Lys His Cys Ala lie 
70 75 80 

CGG ACC CAG TCC TAT GAC TAG AGA TTC ACA CAG GAC TTA GTC ATC TGC 
Arg Thr Gin Ser Tyr Asp Tyr Arg Phe Thr Gin Asp Leu Val lie Cvs 
85 90 95 ^ 

GAA CAC GAC TCC TTC TGT CCA GTG AGG CTT TGT GCT TGT GAC CGG AAG 
Glu His Asp Ser Phe Cys Pro Val Arg Leu Cys Ala Cys Asp Arg Lvs 
100 105 13^0 

CTG GTC TAC TGC CTG AGG AGA AAC CTC TGC AGT TAC AAC CGT CTT TAC 
Leu Val Tyr Cys Leu Arg Arg Asn Leu Trp Ser Tyr Asn Arg Leu Tyr 
3.15 120 125 

CAG TAT TAC CCC AAC TTC CTC TGC TAATGTCCTC TGTGGGCTCT CGCCGGGAGT 
Gin Tyr Tyr Pro Asn Phe Leu Cys 
130 135 

GCCTCCCACA GTGGCGGCCC CCCTCGGCTG TATTCCTGAT CCGTCCACCC AAGGTCTTGG 
ATCTGCCTTC CTCTGTGTAC CACTGGGCTG GACAGAGCCC AGGGTTACAC CCTACCCTCC 
AGAATCCTAG AGAGGGACTC TGATGTAGAG TCTGCGGACT CTGGATAGCT GAGCCTGCAC 
TTGCAGAATT TGGCGCTGGG CCCCGGAGCT CCCTCAGCTC CAGGCCAGTG TCGTGTTGAC 
TTTCCTTTCA ATTTCTGGAA CCCAACTGCC ATTACCACCC TCCAGAGACC TCTTACTAGA 
GGAGAAGCCA AATTAACTCT ATAAATCTGC CATGTAGCTA TTAAATAAAA CCCATTCACG 
AGGCGAGAAG AACACCACCC CAGCACTCCC TCTGACAGGG CTGGGGTAGG AGTGCCAATG 
CTTCTCTAAC CCCTGAGGCA TCTGTGCACC CTCTAGGATG GAGGTCAGGA AACAGGTGGG 
GGCCTTACAT GCCTTTCATG GTTTGTCTTG AGTTTATTTT CTTAAACCTT AGGGTCTTTC 
AAGCCAGACC TGGAGCTCAA GATTCTTCTG GAGGAAGGTG AGACACAGCC CTATGCCACC 
TTGAGCTCCA GGCTAGAAAG GGACAGCCCC TAGCCCTGGC TTCTGCAACT GTGTGGTCTT 
GAACCTCCGT ATAGTCCGAA TCCCTCTGGC TCTCCTCAAA ATATAAAACA AGCCTCCTTC 
CAATAGCATA TTGGTGCACA CCCCTAATCC CATCACCTGG GAGGAGGAGG CGGCAGGAGC 
ATCAGGAGTT CAAGGCCAGC TCCTGCCCCC TAGCAGGGAT GGTAGGCTGC ATGAGAGTGT 
GTCTCAGAAA GAACCACCTG GTGCGGGTAC AGGGATGCTG GGATTCTGAG ATGTCACTCA 
GTGCGGGAAA AGATTCAAGG AGGGGAACAG ATCAATGGCA GAATGACTGT CTGTGCCGAG 163 3 
TTAAGGGCAC TGAAAATCTC AGCTCATCTA TCGCTTTATA GAAGATAGAG CTTTGGGAGG 1693 
AAGCAAGGCA CTCTACAGTA AAGGAGTGGC CTTTCCAAGG AAGGGTCTAG GCTCCTTCTT 1753 
CTCCAGAACA TGCACAGGAC ATAGGAGATC CATTATTTAG AGACCTTTCG TGTTCGAACG 1813 
TTTTCTCCGG AATTC ^^^^ 
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(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 137 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

Met Lys Arg Leu Leu Thr Leu Ala Trp Phe Leu Ala Cys Ser Val Pro 
15 10 X5 

Ala Val Pro Gly Gly Leu Leu Glu Leu Lys Ser Met lie Glu Lys Val 
20 25 30 

Thr Gly Lys Asn Ala Val Lys Asn Tyr Gly Phe Tyr Gly Cys Tyr Cys 
35 40 45 

Gly Trp Gly Gly His Gly Thr Pro Lys Asp Gly Thr Asp Trp Cys cvs 
50 • 55 60 

Arg Met His Asp Arg Cys Tyr Gly Leu Leu Glu Glu Lys His Cys Ala 

70 75 80 

He Arg Thr Gin Ser Tyr Asp Tyr Arg Phe Thr Gin Asp Leu Val He 
85 90 95 

Cys Glu His Asp Ser Phe Cys Pro Val Arg Leu Cys Ala Cys Asp Ara 
100 105 110 

Lys Leu Val Tyr Cys Leu Arg Arg Asn Leu Trp Ser Tyr Asn Arg Leu 
115 120 125 

Tyr Gin Tyr Tyr Pro Asn Phe Leu Cys 
130 135 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1014 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATTOE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 131. ,544 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
GGATACCAAT GTTCCGACTG GAGACGGGGA GCCCGCGAGA CCCGGGTCTC CAGGGTCTGC 60 
CCAAGGAAGT TGCTCATGGG AGCAGACCCC TAGAGCAGGA TTTGAGGCCA GGCCAAAGAG 120 
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AACCCCAGAG ATG AAA GGC CTC CTC CCA CTG GCT TGG TTC CTG GCT TGT 
Met Lys Gly Leu Leu Pro Leu Ala Trp Phe Leu Ala Cys 
15 10 

AGT GTG CCT GCT GTG CAA GGA GGC TTG CTG GAC CTA AAA TCA ATG ATC 
Ser Val Pro Ala Val Gin Gly Gly Leu Leu Asp Leu Lys Ser Met lie 
15 20 25 

GAG AAG GTG ACA GGG AAG AAC GCC CTG ACA AAC TAG GGC TTC TAC GGC 
Glu Lys Val Thr Gly Lys Asn Ala Leu Thr Asn Tyr Gly Phe Tyr Glv 
3° 35 40 45 

TGT TAC TGC GGC TGG GGC GGC CGA GGA ACC CCC AAG GAT GGC ACC GAT 
Cys Tyr Cys Gly Trp Gly Gly Arg Gly Thr Pro Lys Asp Gly Thr Asp 
50 55 60 

TGG TGC TGT TGG GCG CAT GAC CAC TGC TAT GGG CGG CTG GAG GAG AAG 
Trp cys cys Trp Ala His Asp His Cys Tyr Gly Arg Leu Glu Glu Lvs 
65 70 75 ^ 

GGC TGC AAC ATT CGC ACA CAG TCC TAC AAA TAC AGA TTC GCG TGG GGC 
Gly Cys Asn lie Arg Thr Gin Ser Tyr Lys Tyr Arg Phe Ala Trp Gly 
°" 85 90 

v^? I^^ '^^C CAT GTC AAC CTC TGT GCC 

Val val Thr Cys Glu Pro Gly Pro Phe Cys His Val Asn Leu Cys Ala 
95 100 X05 

TGT GAC QGG AAG CTC GTC TAC TGC CTC AAG AGA AAC CTA CGG AGC TAC 
cys Asp Arg Lys Leu Val Tyr Cy^'Leu Lys Arg Asn Leu Arg Ser Tyr 

115 120 125 

AAC CCA CAG TAC CAA TAC TTT CCC AAC ATC CTC TGC TCC TAGGCCTCCC 
Asn Pro Gin Tyr Gin Tyr Phe Pro Asn He Leu Cys Ser 
130 135 



CAGCGAGCTC CTCCCAGACC AAGACTTTTG TTCTGTTTTT CTACAACACA GAGTACTGAC 
TCTGCCTGGT TCCTGAGAGA GGCTCCTAAG TCACAGACCT CAGTCTTTCT CGAAGCTTGG 
CGGACCCCCA GGGCCACACT GTACCCTCCA GCGAGTCCCA GGGGAGTGAC TCTGGTCATA 
GGACTTGGTA GGGTCCCAGG GTCCCTAGGC CTCCACTTCT GAGGGCAGCC CCTCTGGTGC 
CAAGAGCTCT CCTCCAACTC AGGGTTGGCT GTGTCTCTTT TCTTCTCTGA AGACAGCGTC 
CTGGCTCCAG TTGGAACACT TTCCTGAGAT GCACTTACTT CTCAGCTTCT GCGATCAGAT 
TATCATCACC ACCACCCTCC AGAGAATTTT ACGCAAGAAG AGCCAAATTG ACTCTCTAAA 
TCTGGTGTAT GGGTATTAAA TAAAATTCAT TCTCAAGGCT 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 138 amin acids 

(B) TYPEi amino acid 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

Met Lys Gly Leu Leu Pro Leu Ala Trp Phe Leu Ala Cys Ser Val Pro 
15 10 15 

Ala Val Gin Gly Gly Leu Leu Asp Leu Lys Ser Met lie Glu Lys Val 
20 25 30 

Thr Gly Lys Asn Ala Leu Thr Asn Tyr Gly Phe Tyr Gly Cys Tyr Cys 
35 40 45 

Gly Trp Gly Gly Arg Gly Thr Pro Lys Asp Gly Thr Asp Trp Cys Cys 
50 55 60 

Trp Ala His Asp His Cys Tyr Gly Arg Leu Glu Glu Lys Gly Cys Asn 
65 70 75 80 

lie Arg Thr Gin Ser Tyr Lys Tyr Arg Phe Ala Trp Gly Val Val Thr 
85 90 95 

cys Glu Pro Gly Pro Phe Cys His Val Asn Leu cys Ala Cys Asp Arg 
100 105 110 

Lys Leu Val Tyr cys Leu Lys Arg Asn Leu Arg Ser Tyr Asn Pro Gin 
115 120 125 

Tyr Gin Tyr Phe Pro Asn lie Leu Cys Ser 
130 135 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15328 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

AAGCTTTGTG GGATTTCTAT TATGAACAAC ATAGGTGCCT TTCCAACTCG GGAACAGAGG 60 

AAATATGGAC TCCTCAAAAG AAAAAAAGAA GAGATGAAGG GATGATGTTG CCAAAGAAAG 120 

AAATTTGGAA AAAAAAAAAC CAAACCAACA TTTGCACTTT CAAAACCATG GAACCCTTCT 180 

TATTTTTATA TGTTCAGATC TAAATGCCAG AAAGGTTACC ACATTCAAAG GGAATGAGAT 240 

TTGAAAATGA TTTCTTTGAG TCCTCTGCTG AGGTCTTTCC AAGGCACTAC AATTAGGGCT 3 00 

TTGCACCCAA ATACCCTTGC CTCATTTTGG TCATTTTTGT CCTGGAACAG AGGTTCAGCT 360 

GGGAGACCCC TCACACACAG GTGAAGGCGT GGCTGTAGAA CCTCAGACCC CCTGGTCTCC 420 

TCAGGAATGA AGGTCATTGC CATCCTCACC CTCCTCCTCT TCTGCTGTAA GTAGAGAGCG 480 
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TTGGTGGGTC AGCACCAAGC TTCTGTCTTC CTGTTTATGT CAGTGGGAGG GGGGACTCTC 
CAGGTGGCAC CAGGTGAGGG AAGTCACAAG TCCCGCAGAA AAGAATCAGG AAAGGAACGG 
GCTCCCACCA ACGTCCTCTT GCTTCTGTTT CTGCTATAAA ATGSGCTGAT CCCAGTGTTG 
GGATCTTATA AAGTGTCTAG GAAATCAGAG GTTGCCAACC ATTTGCTAGA AAGGGAGTTT 
GAGTAGTATT TTACCCCCCC TCACCCTCAA GA6TCTTTTT ACTTTGGATG CTAGTAGCCT 
TTTATTTAGG CATTGGATCA GAACAAAAAT GCJ^GGACATA TATCCAGCCT AATTTAACCA 
ATGGATTAAA TGGCCTTATC AGGAAAAGAC CATTTTATGG TGACTTATGG GATAATTGGT 
AGTTATAAGT CATTGCTGCC GGGAGATCCG ATTGCTTACC TCTGCAAAGT GAAGAAAGAC 
CTACTGGGAA ACAGTTTGGG GTCTACTGGA GACTGATAGA CTCTTTTGCT GGATTCGTTG 
AGTGGAGGTT TCTCCAGATC CATTTTCCTG TCTCTTTCAA TTGAGTCACA ATAACTTTTG 
AGTCCCTAAG TCAAAGATGT CAAAAACAGA CTTCCTTTCC CCACAGTGAG TGGTGGAATT 
TACACTTTGC AAGGTGATAG TGCAGGAGGA TACCTGTACG CAGGGATGAC CGCCTCTGCA 
GCCCTCAGTG CGGCTCCAGG ACTGCTTGGC CACCAGTGAC CCCCCCATGG GTTTCTTCCG 
CCACACCCCC GTTTAGACTG AACACGATAG GTAGATCGAA GGCCACCTGA GAAAACTCCC 
CCAAAACTCT ATTTCTGTTT CTCTTCTTCA AAGTTCATGT CTTTGTTGTA TTTTTATTGC 
AAATTTACTA CATGCTTATA GTTAAAAAGT AAAATAAATG AGTATATAGC AACAAGGTAA 
AGCTCCTCCT CATCCTCCCC AGACCCCAGT TTTTTCCCTA CATCCAGATG TGACCACTCT 
TAAGAGTTTG ATATACATCC TCTATACAGC GTTTACCACA CACACATTCA AAACACCATA 
ATAGGAAGGG AACACATGCT GGGCCGGGCG CGGTTGTTCA TGACTATAAT CCCAGCACTT 
TGGGAGGCCG AGGCGGGCGG ATCACCTGAG GTCAGGAGTT CGAGACCAGC CTGGCCAGCT 
GGCAACATGG TGAAACCCGT CTCTATTAAA AATACAAAAA ATTAGTCAAG CATGGCAGTT 
GGGCACCTGT AATCCCAGCT ACTCAGGAGG CTGAGGCAGG AGAATTGCCT GAACCCGGGA 
GGCGGAGGTT GCAGTGAGCC GAGATCACAC CATTGCACTC CAGCCTGGGT AACAACAGCG 
AAACTCCGTC TCAAAAAAAA AAAAAAAAGA AGGAAAGGGA CACACGCTTA TTATGAAAGA 
CATGAGACAG CGGAGACGTG TATAAATGAT GTTGCCTGTT TTCTTTCTCT CTCTTCATCC 
ATGCTAGAGA TAGTGCTATC AAATGTAGTT ATTTTTGAGA CACATATTTC GTATTATCCC 
TGTCGTGACA TGTGGGTGGT TTCCAATTTT TTGATATCAC AGATAATGCT TCAGGAAACC 
ATTTTGTGTA TCGATTTGTG CCCACTCTCA TAAGCATCTT GTAGAAGCAA AAACAGCTGA 
GTTCATGTGT ACTTGTCATT TAAAAAAATA ATAATTGAGG ATACCTTTCC TGCCTCTTAA 
GTATTTTGTT TCTCCTGTGA GATAGTAAAG GCCTGATGAC ATCTGGAGGG ACTGGCGTTT 
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CTGGCTTTGA ACTTTTGCCA TTCATGTTGC ATCAGACCCG AGGGTGTTCT GCCTAGAACT 
GTGGTTTCTT GCTTTGAGGG GGAAGACTAT GGTTGATGGG AAAGCCTTGT TCTGAACCTC 
ATGGAAACTG GGTATTCATC TGGGTTAGCA AAAAACTAGC TGTGTTACAG GGGCAAATCT 
GAACCTATTT TATTCCCCAG GAAAGAGGCT GGTGATTCCA GCCATGCCCC TTGCACTTCG 
CTTTGGGGAT CTGGTGATAT TTCGAATGCT CAGCACTCTA GTAAGGGGAG GGGACATCAA 
GGCAGCATCA TGCTCATTGC AACTTCCTTC TTCCTTTTTT TCTCATCGGT GGTGGCAGCC 
CCCACCCACA GCAGTTTCTG GCAGTTTCAG AGGAGGGTCA AACACATCAC' GGGGCGAAGT 
GCCTTCTTCT CATATTACGG ATATGGCTGC TACTGTGGGC TTGGGGATAA AGGGATCCCC 
GTGGATGACA CTGACAGGTG GGTGCAGAGG CTCTAAGGCC ACTTATCATT TGTTTTGCAT 
TAAAGTTCAT GCTCAAAGCC AGAGAGAGGG TCTTAGGATT CTTGCCTGGC AAATAACAGA 
AAACAACTCA GGCTAATGGA AGGAAGAACT GAACGGGATT TGGAGGATGG GTCTTGAGAA 
ACCCAGGGTC GGGGCCAGCT TCTTGAGTGT GTGACCTGTG AAGTTTCACA GGGCCCAACA 
CTCATAAGGG TCAGGGCCAG CTTCTTGAGC GTGTGATCTG TAAAGTTTCA CAGGGCCTGG 
CACTCATAAC CCCCTAAACA TGGTTTACTG CTCTGCTGCC ACATCTTGAA ATTCTTAATA 
.AAGGGCCTCA TGTTTTCATT TTGCTTTACT CTCTGCAATT ATGCCGTTGG TCCTGCCCAG 3180 
AGCTCTAGAA GCTGTTTCAT CCTCATAGTA AAAGTGCTCT GCTTTCAGCT CTCCAGCTTT 324 0 
TAGCACTATA CCCACAGCAC AACTGACTCA CTAGTCCTAA TTCCATATTC TGGAGAGGGC 33 00 
TCCAAAGTGG CCCACTTTGG AGAAGTTGTC CATCTGGGTG AGGTTGCATG GCACAAACCT 33 60 
GGCTTCAGGC CTACTCCAAA GGATGGGGGT GGGGGAGTGT GAGTTCCTAG AAAAAGTAGA 34 20 

GGTGGGTGTC ATCTGGTGAA TGTACGTGTG GGGAGGTAAG AAACGGGACA GTTTGCGTCT 
CAATTCATTT GAAGACATAA GAAAGCAAAA TGTTCCTTGC CACATTTAAG GTAGTATGGA 
GAAACATGTC CCACAGTGGC CTTAAATATC ACTCTGAGCT CGAGTCTTGT GGTGGCTCAT 3600 
GAACCATGGA GGACCTAGAG GTTCGAAGGG CAATTGACGC TTATCAAATG CCCTTATGTG 3660 
CCAAGCACTG GGACTGGCCG ATTGGCATAC AAACCTAATT TAATTCTCGC AGGGAATGCA 3720 
CGACACAGTT GATACCAGCC CATTTGACAG CCTGAGGACA TGTGAGTTGC TAAACCACCT 3780 
CCTAAAGGCA ATGCAGCTTC TAAGTGGCAG AGTTTAGGAT TGAACGAGAA TTTGCCTATT 3840 
TCAAAGTTTG TCCCCTCTCC TTGATGGTCT GTGCCTCCCC TGTCAAAGTC CAAAGGCTGA 3900 
TTAGAAATTG AACATCATTA GCCAAAGCTG ATCAACAGCA GAGCCCCCAC TTGCAGATGG 3 960 
GAATGGTGAG AGAGGGAGAC TGAAACACTT TTTTCTTGGC CTTTCAGGGT TTAGAATCCA 4 020 
AGCTTAAGTT TCTGCCTTCC TGTCCCTTGT GTAGTGGTTG AGGACATGGA CTGAGCCCAT 4080 
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GCTCCAGATG GTATTTCTCC TCCAGTGCTC TCCCATCCAG CCCCCAGCCA ACTCTGGGTG 
CCATGAATGG GACTACGTCG GCTTTTACAG ACAGTTGTCT CCTCAGAGAC CGTTACAGTG 
CCTGACTCAC AGTAGGTGCT CAGTAAAAAG TGTTAAATGA ATGAATGGGC CTAGGTTTGT 
GTCCTGGGTC TATCATTCTC CAGCTGCCTA AGTTTGGGAA ATTGGCCTCT TGGAATCTCA 
GTCCCTCCCC TACAAAAGGG CAGCAATGAT TGTACTTTAT AGTTTCTAGT AGCTAATGAG 
ATAGCAACAG ATACTACAGA GGGCTCAGGA AATGCTACTG GTTATTATTA TTATTTTTTA 
TTTTATTTAT TTTTTGGGAG ACGGGGTCTT GCTCTATTAT CCAGGCCTGG GGTGGAGAGG 
CTCAATCAGA GCTCACTGCA GGTCCTCAAG CAATCCACCC ACTTCACCTC CTGAGTAGCC 
GGGACCACAG GCTGGTGCCA CCATGCCTGG uXilTrrrrr TTTTTTAAAC TTAAAAAACA 
TAGGCGGCTC CCTATGTTGC CCAGGCTGGT CTCAAACTCC TGGACTGAAG CGATCCTCCT 
GCCTTATCCT CACAAAGTGC TGGGATTGCA GGCATGAGCC ACCACACCTG GCCTATGTTT 
AATATTATTG ATAATTCACC TCCTCACCTT CAATGCCTTC TTGCCTAGAG GAGGAGGCAG 
GTGAGCCCTT TCTAGTCCCC AGATAAGGTC CTCCAGCAGA TTCCTGAGGG ACCCACTTCC 
AGGCACAGCC CCTCATCTCC CTCTCCCTAC GAGAAGCTGA AGGAGTTCAG CTGCCAGCCT 
GTGTTGAACA GCTACCAGTT CCACATCGTC AATGGCGCAG TGGTTTGTGA GTAGCCTTTT 
CTGTATGGAA ATGTCTTTTA ACCTGGGCCT TTCCTTAACG TTCACCTCCT CTTTGACCCA 
GAGATCTTTT AGAAAATGAA ATGCTTCCAA GTGCTTGGAA GGAGATATTC CTGAGCITTC 
TCCTGATGCT CCAGAGCTTC TCAGAGTGTC CGTGCTCATC CTGCCCTGGT CTCTCCCACC 
CATGAGTGTA CCTCCTGAAC TCTCTGGGGG CCCAGAGCCT GGCAGATAGT ACATGCTCAG 
TAAATACTTG TTCACTTGAG CTAATCTTGA AGCTTCCCTT GACAACTGCT GCTGTTGAGA 
ACATGTTTCC TTGTTTCTGT GATTTTGTTA ACAAAACGGC TCAGCTGTCT TCCAGTTGGA 
CAAATATTTA TTAAGGGCGA CTGCATGCCA AGCACTAAGA TAGGTGCTGC CAGGGCCACA 
AAAGCAAATA GGTGGGAAGG GAAGGGGGAC TCACATGTTA CTGAGACCAT TCAAGGAGCC 
ATGTGGGCAA GTGGATCAGT GCCCTTCACA TGGGGCGTGG CCTGGCATCC GGAGCGTGTT 
CTGCGGCTGG TAGGGTATGG GTATGTGCAG GGCAATCCTG GCCTAGACAG CAGGCACATT 
TGGAGGCACG GGACAGTAGT CTTTCGTGAG CACCATCCrrr TCCAGCATAG CCAGGGTGGA 
TCCTGGGGTC CTGGGCTGGG AGGGTGAAGA GCAACAAATA AAGAAGTGGC TTCTTGGCCG 
GGCGCGGTGG CTCACGCTTG TAATCCCAGC ACTTTGGGAG GCCGAGGCGG GCGGATCACG 
AGGTCAGGAG ATCGAGACCA TCCTGGCTAA CACGGTGAAA CCCCGTCTCT ACTAAAAATA 
CAAAAAAAAT TAGCCGGGCG TGATGGTGGG CGCCTGTAGT CCCAGCTACT CGGGAGGCTG 
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AGGCAGGAGA ATGGCGTGAA CCCGGGAGGC GGAGCTTGCA GTGAGCCGAG ATTGCGCCAC 
TGCACTCCCG CCTGGGCCAC AGAGCGAGAC TCCGTCTCAA AAAAAAAAAA AAAAAAAAAG 
AAGAAGTGGC TTCTTATAGT GT6TGGCTCA CTTCCTGCCT GGCCTCGTGG GGTTGCATGA 
ATCACTTTCC TTCCCAGGTG TATTTATTCA GAGCTGTGAG TGCACCTTGG AGTTCCTCTG 
TTTCCTCCTG AGGTCyVGGGA ACTACCACCT CTCTGCCACT CATCCCCTAT GGCGGGAGAT 
ACATCCTCCA TCCCGTAGTG GGTTCCAGGG CTCAGAACCC TGGTACTCCT GAGCTCCCCA 
ACCCACCACT TCAGCTCAGC ACACACCAAT ACCCAGAGTT AGGACTGTGA GGTCTCCCTG 
GCACCAGCTG TGTGGGTTGG GGGCTCGSAC CCCTGCACCG GGAGGACCTG CCTCAGCTCT 
TGGCCTGCCC TGCCCACTGC CACCAGCACG TGGTTGACAG GGAAAGAACC CCCTTTTGrr 
CCCCACGTGA GCTCAAGCAA TCCACCCACT TCAGCCTCCT GAGTAGCTGG GATTACAGGT 
GCCCACTGCC ATGCTTGACT AATTTTTTGT ATTTTTAATA GAGACGGGGT TTCACCATCT 
TGGCCAGCTC AGCACACACC AATACCCAGA GTTAGGACTG TGAGGTCTCC CTGGCACCAG 
CTGTGTGGGT TGGGGGCTCG GACCCTGCAC CGGGAGACCT GCCTCAGCTC TTGGACTGCC 
TGCCACTGCC ACCAGCACGT GTTGACAGGG AAAGAACCCC TTTTGTTCCC ACGTGAGCTC 
AAGGAGACTT CCCTGAGTTG GAGCTCTCTG GTGTGGTCCT TCTCAGGCCT AAAGCAAAGT 
GTCTTTTCTG TGACACCTCC AAGGCCATGT TCAGGAGAGG GCAAGGGATC AGGGCCTGGT 
GGGAGGGATG GGGAGAGGGG ACTGGAGAAG GTGGCCTCCA GGGATCGAGT TTCCCATGGC 
CTCTTCCCAC CTGTCTTTGC CACAGGGGTG GGGACACCTG GCTGGCCCAG CCCAAGCCTC 
CACCCTGGGC TCCTGTGGGC TGGCTGCACT CGCCAGGGCT GGCCTAGGCT CTCTGCACCC 
AGGGAAGCTT CTCTATTCAA TGCTCTTCAC CCTCCCAGCC CAGGACCCCA GGAGATGAGG 
GAGAGTGGAG CAAAGGTTGA GGAGCAGAGG CTGGAGCCCC AGGCAGTGGC ACTGCTGGGC 
AGTGGTGGGA GGTGCCAGCC AGGGCTGGGA GTTGGACCCG AAAGTACGTG GCCTGGGCTG 
TACTTTCTTC CCACGTTGCC CCTTCAGAGC AGAAGCAGCC AGTTGCTCCT GAAGCCTTGA 
CCAGGGCTCC TGAGTCCAGA GCCTTGCTCA GGGCACTAGC GTGGGAGGAG GCTTCCGCAT 
CAGTACAGGG CATCAGCACC CGCCTCCTCA GCTGACCCAG CCCCGTGAGG ACCCAGGCCC 
AGCCCCCTGT CATCCCCACC CCCACCTTGC CAAGCCCCTG CCCCCAGGAG CAGGGCTGAG 
AGCGAGGTGA TCTGGGTTCT AATCCAGAGT CTGCTGCTGA CATGTGCTGA GCCCCAGGCC 
CATTGGTTTA CTTGCCCCAG TATTGAGCGA GCATCCACTG GGTACCCGCC CAGTGCCGGT 
GCTGTGCCAG GGGCCGGGGC ACAGAATAAA GCAGACCCGT CCCTGCTCTT CTGGCATTCA 
CAGTCTTGTG GAAACTCCAG ACTGAAAGTG CCCTTAGAGA TTATCCAGAT CAGCCCCTCC 
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TTGTAGCAAT GAAGAGACTG AGACCCACAG AGGGGATGAG TTTGATCCAA 
AGATTAAGAT GCATGTGTCT TGAACCTTTT CAGTGCTCTG GAACATACCG 
GTTGTCTGGG CTTTGGTTTT CCCATCCATG AAATGGGTAC AATAACAACA 
ATGAGCCTCT GTGATAGATG CTGTACGCAC AGCACCTGAA CTCACATGAT 
GGTGAGCATT ATCTCCCATT ATCAAGGAGG ACCCTGGGGC TCAGAGAGGT 
GCCAAGGCCA CACAGCCAGG GAAAGAAGAG TTGGAATTCA AACCCCGGGT 
ACACTAGCTT CCCCTGTGGA GGGTGCTGGT GTGTGCATGA TTGGAGGCCC 
TAAGTCTCAG GATCTGCAGC AAACTGGTCA GAATGCTCTG CCCTGGCCCA 
GAGGGGCAGA TGGAGTTTGC TTCGCTGTAA GGCCCCGGAG CTTTGTGTTC 
GCCTCAGAGT CGGGCAACAC TGGGTCTAAT TCCAGCTCCA CCCCTTGTAT 
GCCTTAATCT CCTCATCTGT AAAATGGAGA GAATCGTCGC CTGTACTTCA 
GGAAGGATTA GCTAAAGCAA CCCAGCTACA GTGGCTGGCC TACAGTAGGT 
TGCCCTTCCT TTTAGATGTG GAAATTCCTC TTTTTGTCCA AGTTTTCTTT 
TTACGGCACT GGGATTTTCT TTATTACTGT TTCTTTGAAG AGTCCGCTCT 
CCACGGCTAT GGTCAGTAAC CCCTTATGGA ATAAAACCCC TTTCCTGGCC 
GCTCATACCT GTAATCCCAG CACTCTGGGA GGCTGAGGCG GGAGGATCAC 
GAGTTCGAGA CCAGCCTGGG CAACACAGTG AGACCCCTGT CTCTACTAAA 
ATTAGCCAGA TGTGGTGGTG CATACCTGTA GTCCCAGCTA CTCAGAAGGC 
GGATCACCTG AGCCCAGGAG ATGAGGCCAC AGTGAGCTGT GATTGCACCA 
GCCTGGGCAA CAGAGTGAGA CCCTACCTCA AAAAGAAAGC AACAACAGAA 
CCTATCCTAA TTGCACCTCC ATTCAAAGAG CTGCCCCTGC AAGAGTTAAC 
GCCTCCCATG AGTTCTGAAA TCCTGCACCC AGGCCTGGTC CCAGTTGCCT 
GGCTGCTCTG GGATGCAGTA GGTAAGCAGG GGAGGGAGAG GAAGAAAACA 
TCCACGACTC TAAATGTCAC TGAGAGATCA GTGCAGAGAA AGGCCTGTCA 
GGGCCCAATT TGCCTGGTGG TAGGGACAGC TGCCCTCAGG CCACCTGGGA 
CCTCCTTTGA GTGGGCTTAC ATAACTACTT GGCATTTTTG CAAGGGACTT 
CAGCAGTGAC ACCCCCCTCC GCCCACATGC ACATACATGT GTGGTACAGG 
TGTGGGAGGC AGAGATGGGG TTCCAGCCAA CTGAAACTCC ATCATCTGCA 
TCTGACTGCC TCCCTCTGCC AAAGCGGGAA GATGAAAATG GTAACTGCTG 
TTTGCAAAGA CTTTTCTCAT TTACTGCTGA ATATATTCCT CATCTCAGCC 



GAAACAGACA 
TCTGGCCGGA 
GCTATAGTGT 
AAACCACTGA 
TAAGCACGAT 
GCCCTGTCTC 
TCACACAGTG 
GGGAAGGAAA 
CTGCTGAGAA 
TAATAGCTGG 
TAAGGCTGCT 
GCTTCATTAA 
TCCTCTTTGC 
GTACTTGTGC 
AGGTGTGGTG 
TTGAGCCCAG 
CATACAAACA 
TGAGATAGGA 
CTGCACTCCA 
AACCTATTTC 
CAACTCCCTA 
AGCAACCGGG 
ACTTGGTCTG 
CCAGAGCCCA 
GGTGGTTATC 
TAAGCTCACT 
GAGGACCCGG 
TCTCCCGGCC 
GAATTTGTAT 
TCCACTCGCT 



7740 

7800 

7860 

7920 

7980 

8040 

8100 

8160 

8220 

8280 

8340 

8400 

8460 

8520 

8580 

8640 

8700 

8760 

8820 

8880 

8940 

9000 

9060 

9120 

9180 

9240 

9300 

9360 

9420 

9480 
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GACACGCTAC 
CTCTGTGTCT 
GCAAGCCTGT 
GACTAAGCAG 
AGACACATGT 
TGAAATAGGC 
AGGGAAGCAA 
GGCTTCCTGT 
GCAAGGTAGT 
GGACAAAATC 
TTCATTAATA 
ACAGGCTGCA 
GTCGAAGAAG 
AGGAGCAGAG 
TTCTATGTCA 
CCCCAGAAAG 
GGCCTCATTA 
GGTGACCAAT 
ACCTGTCGAT 
CTGAGTGGCT 
GATGAAGACA 
' TAAGGGGTAG 
CTGCAGCGGA 
GAATTCAGAG 
GGCAGGAATG 
AAGTCATTCC 
AAAGATGCAA 
CTTGGCCTGG 
GCCACAGAAG 
GGCAATGGGG 



CCACTGTCTC 
GTGTGCCTCG 
CTTGTTTGAG 
TGTGGGGTCT 
TGGGAACTCA 
AGTGCACGTA 
CTTGCCTGAC 
CTCCAAGGGG 
CAGCAAAGAG 
CTTGCCTGGA 
AAGACACATT 
AAATGACAGC 
GCCTCTCTGG 
GAGCCTTGGG 
ATGGCAAGGG 
ATCACTCTGG 
GGGGACTGTT 
GGATGGAAGT 
GGGTTCAGCG 
GTAGATAGTG 
GCAAGAGTTT 
GCAGGTGGAC 
GAAGTT6GGA 
ACAACCAGGG 
CAGACTTGCT 
AAAGCCCTCC 
GATCTGGAGT 
GGCTGTCTCA 
TGACAGTCCA 
CAGGCCACTG 



TCCCAGCATT 
ACTCTCCCCC 
GGCACTGGTT 
GCTGGCTTGC 
ATACTTGTTT 
TCTTTATCAC 
ACCCCAAATC 
TCAGGTCCAG 
CCTTTACTGC 
AGGAGCTTGT 
CAAACAGGGC 
GTTTGGGGGT 
GGAGGTGGCA 
GAAGAATGGA 
AGTCATTGGA 
CTACAGTGCA 
GCAAAGCACA 
GTCAGATGTC 
TGGGGTGTGA 
GCACCAAATA 
GGTGTGAGTC 
ACGTGCTTAT 
GTATTCAGAG 
CTGAGGCGAG 
GCCTCTTCTT 
TTTCCTTTCT 
TAAGCTTGTT 
TCATCCAGGT 
CTGCAAAAGA 
AAGTAGAACT 



CATCTCTACC 
ACCGACTAGA 
CTCATAGAGC 
ACCTGTGCCC 
GTTTAATGAG 
CATTTGAAAG 
ACAGAAGCAG 
CTGGCATTGG 
ATGTTGGGGT 
GTTCCAAAAA 
CAAGTGCTCT 
CAGGAGACAG 
TTTGGTCTGA 
GATGAGGTTG 
GCATGTGAAG 
GAGAAAGAAG 
GGGAGGCACA 
GCATGCTGTC 
AGGAACACAG 
CTGAGCTCGT 
ACCTTGAGTT 
TGAAGTCTGG 
TTCTGACACT 
GGGCTTAGAC 
ATTTGTGGAG 
TCATGCCTCA 
TTTAAAAGGT 
CATGACAGGC 
CTGCTGCTCC 
GGATGTCAGA 



TGAAATGATC 
AAGGTCCGTG 
CACAGGGAAT 
CCAGCTCCTA 
TAGATGAACA 
CTGAGGAAAC 
CATATTTGGC 
CCTGTAGGCA 
CAGAAGATCA 
GAACAAGAGA 
GAAGCACCTC 
AAGGGTGCCT 
GACCTCAGGG 
GACAGGATGA 
CAGAGGATGC 
AGAGTCAAGG 
ACCACAGCCA 
GGTAGGTCAG 
GCTGCACCCC 
GAAGATGGGG 
TGAGACACGT 
AGCCAAGGGA 
GACCAAGAAC 
TGGGGCCTGG 
ATGTAGTTCA 
GTTTCTCCAT 
GGCCTCCAAA 
CCGGTCCATG 
AGATCAGTTC 
TGCACGCATT 



TTGTTTACTT 
AGAGCAAGGA 
GATGCCCCTG 
GCCAAAGACC 
AAAGCACTCA 
AGGCTTGGAG 
CCAAGAACCT 
TGTGAGTGTG 
GCAATAAGGA 
CCACAGCATA 
AGACAAAGCG 
GCTTTAGGTG 
CCAATGTGCT 
GACACGTGCC 
TCTACTTTTG 
AGGAAAGAAG 
AGATCAGCAT 
GGCCGACAGG 
AGCTCCTGGC 
GAGAGCTGAT 
GTCAGACATG 
GAGGTGTGGG 
ACCCCTCAGA 
GACAGCCACA 
TGCAGCAAGA 
TAGCACATTA 
GACGGTTTTT 
GTTGAGGAAT 
TGGAAGGCCT 
AGAAAGGACA 



9540 
9600 
9660 
9720 
9780 
9840 
9900 
9960 
10020 
10080 
10140 
10200 
10260 
10320 
10380 
10440 
10500 
10560 
10620 
10680 
10740 
10800 
10860 
10920 
10980 
11040 
11100 
11160 
11220 
11280 
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GGAAGACCAA ATGAGAAAGG GAGAGGGGGC 
AGGCAAAGGA AACAAGAGAT GGAATAGAAG 
ACAGAAAGAG AGAGGGACGA GAAAGAAGGT 
GGCGACAGCA ACTGGACCAG GCCCTCTAGT 
CTCAAGTAGC CCTCGTCCCC TGAGAGAGTG 
TCTGGAAGGA TCTTCACAGA GGCAGGAGGG 
TAGAACAGTG GGAGTGGCGG GACCCAAAAC 
AGATTCCCCT GCGGCCGAGC AGGAGGGCAG 
AGTrcCAGGG AGGCGTGGGA GGACGGCTTC 
GGCAAACAGA CTGACTGCAG GCAGCTCTGC 
GGTGGAGCTG GAGTGCATCA CCCTGAGAAC 
CGTGCCAGGC ACATCCGGAC CACTTGTCGG 
GGAATGGGAC AGTGTCATGT GTTTCTGAAA 
AGGAAGCTGA AGCTAACTTG CCCAAGGTCA 
CACTGCCAAC CCTGAGAGCG GACTAGGGTG 
AGAGCCCTAA ACACACATGC GCGCGCACGA 
GCACACACAT GCGCACGCAC ATACACACAC 
AAGCATACAC ATGCACACAG GCACACGCAT 
ACATGTGCAT GCACACAGTG CGACAGCTCT 
AGTGGTGACT CGGCCAAAGT GCAGACACTG 
TCCCTTCTCT TATTCTTCAT TCATGGATTC 
CCTGGATGCT GGGGATACTG TGATGACTTA 
TTCTTTGAGG GGAGGCAGAC ACAAGCCAGG 
CAGTGAGTGA GACAAACGGG TACGGAGGAC 
CAGGGAGCAC CTCTCTGAGG AGGTGAAATT 
CACTGCTTGC AGATCCTGGG GAGAAGCATT 
CCAGAGACAA GACAGAGCAA GACCTGTGAC 
AGGCTGGGAG AGTGGAGGGG GAGGGCGATG 
CGGCAAGTGT GCCTGGCCAC GGAGGCCACG 
AACCACACTG GGCTGTAACA GAGAGTGACG 



-74- 

AGGGAGAAAG 
AAGACAGAGG 
GGCTGAGGAA 
TGGACAGTGA 
GGGGCTACCT 
GGAGTGGGAG 
CGGAGAGCCA 
GAAGCTCAGT 
TGCAGGACGC 
CGGCTCCACA 
CAGCAGCAAG 
TAGACACCAG 
TGACTAGGTT 
TAAACCGGGC 
GAGTTATCTG 
CACACACACA 
ATGCACACAT 
GCACACACGC 
GATTAGTAGG 
AACCCCAAAG 
TATTGAGCAT 
GACAAGGTCT 
AAACCAATAA 
ATGGCCAGAG 
TGACCAAGCC 
TTAGACAAAA 
ATGAAACAGG 
AGGGTGGAGA 
GAAGGATTCA 
TACTCGGTAC 



GAAGGAGAGC 
ACCAGAAGAC 
GGTGAGAAAA 
GG CTGGCTGG 
GGGGAGCTGG 
GGCAGAGGGC 
GAGGAGTGAA 
GGTGTTGGCA 
AGACTTTGCA 
GGGCGCTGCT 
CCCCCACAGG 
TGACCCTCAC 
TTAGCACCAT 
GTCTGGTGGC 
GAAAGAGGAA 
CGCACAAACA 
GGACACATAC 
GCATGCACAC 
TAAATAAAAG 
GCCCATAGAG 
CTGCTCTGTG 
CAGCCGCACA 
GAGAAGTTAA 
AGAGCTTTAG 
TCAAACAGTG 
AGAACAGCAA 
CTGGTGTGCC 
AGCTGGTGAG 
GCATGTCTTT 
GTTGAGAAGG 



TAGAGACTTG 
AGTGAGACCA 
GTGTTTCCAG 
GGGGCCTGAG 
GCTTGATGCA 
ACCCAGGCGC 
CATCCCTGGC 
CAACGTGAGA 
GAGGGAGAGT 
TTTTCTCCAC 
GCACCTTCTG 
CACCACCCCA 
TTCATAGATG 
CTCCCCTCCT 
GCTGTACCTG 
CACAATGCAC 
CTGCACACAC 
ACATGCACAC 
GTTCCCATCT 
GCTTCATTCA 
CAGCATCTGT 
CAGCTTATGC 
GTAAAAAGCA 
TTCAGGTGGT 
GCAGGGATCC 
GTCCAAAGGC 
CAGAGCAGGG 
GGTGGCATCC 
CCCGAATAGG 
TCCTGCTTAT 



11340 

11400 

11460 

11520 

11580 

11640 

11700 

11760 

1X820 

11880 

11940 

12000 

12060 

12120 

12180 

12240 

12300 

12360 

12420 

12480 

12540 

12600 

12660 

12720 

12780 

12840 

12900 

12960 

13020 

13080 
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13140 
13200 
13260 
13320 
13380 
13440 



-75- 

TTCCTTCCCT GAAGGAGGAA GAGCTGCTGA TGACAGAGAT TGGCAGTGGC CAAAGACATA 
GAGAGAAGAG GGCAGAACAT GGGCTATTTT AAACACAGAG AAGATTAGCG GGACCCGCTG 
GCAGACCGGA CGTGAAATGT GGAAGGAGCG GGGGCAGCGA GGTCGGCTCC TAGTTTCCTG 
AGAATGTGGG TGAATCACGG GCTCACAGGC AGAGGGAGCA CTAGGATATC AAGGGTTCCC 
TTGTGAACGC CTCAAGTTGG AGATGCCTGA GACATCCAAG TGAGATGTCA AGCAGGCAGC 
TGGAAATAGG AGATGAGCTC TGGGAAAATG CTCCCATCAC CCTGGCCTGT GTGCTGCCTG 
GGCGCACCCA TTCAGGGCCC TCCACGCAGC CCACGCCCCT GCCTCCTGAT TCCTTCTAGG 13500 
CTTCTCCAGC ACTCGTGGGA TGCCCAGATG TGATCAGGGA AGGGCTTGAG GATGCAGGGA 13560 
AGCTGTGGCT GAGAGCCCTA AACACACACA TSCACACGCA CACACACATA CACAGGCACA 
TGCACACACG ACCATACACA CACACAAATG CACGCAGATG CACACAAATG CATATGCACG 
CACACAAATG CATATGCACA CACACACATG CACACATATG CATACACGTA TCCCTTTCAG 
TGGCTTTCCT TTCTGTCCTT AACCCTTGGC CCCTTACAGT GAGCTCCCAG TTCTCCCCAG 
CCTTAGAACC AAACCCTGGG GCTGGCCTGG GAGCCCCCAG TGACCCTCTG TGTCTCTGTA 
GGTGGATGCA CCCTTGGTCC TGGTGCCAGC TGCCACTGCA GGCTGAAGGC CTGTGAGTGT 
GACAAGCAAT CCGTGCACTG CTTCAAAGAG AGCCTGCCCA CCTATGAGAA AAACTTCAAG 13980 
CAGTTCTCCA GCCGGCCCAG GTGTGGCAGA CATAAGCCCT GGTGCTAGGG ACACCACAGG 
GTCCCTCTCA TCATCCAGCA TCCGCTCTAG TGTTGCTCTT CCAGGAAGCC TTCTCAGATC 
ATCCCCAACA GGCCCCTGTT CTTCCACTGG GAGCGAGGAC AAAATGTCTC CCGCAGGGCA 
GCTCACCCTT CAGCATTCTG ACCAAGGGGA CTCCCTGTCG TTCAGCATCA GAGGGCTGGA 
GAGCAGAAAT GGGAAAGAT& AGATGCCTGC CCTGCAGGAG CTGGCATTCT GTGGAGTGGG 
GAGGACTACA AATGCATGGA TATAGAAGTA AGAGACACAT TAGACTGTAG TAAGTGCTAT 14340 
GATGCAGTAA AACAAAGGGA CGGGATAGAG ATGCACCCAA CCCCACATCC CAGGGGTTTC 14400 
CAGGAGGGGA GAAGCCCCAG GATCTACCCC AAACTCTCTC TTCACCCCCA CTGCAAACCG 14460 
GGACACAGAG CAGACTTGAG CGCCAGGCCC ATGCCCAGCT CTAGCTGGCA ACAAAGCCAC 14520 
CACTTTCCTT GCCCCTCTGC GTCCTCAGTT TTTATGATGT CATTCTTAGC TTTTCTTATC 14S80 
AAGAGGCAGA ATCTGTTTTC CCCATCCCAT GAATCTGAAC TGGTCTTGTG GCTTAGTTTG 14640 
GTCAATAGAA TGTTGTGGGA GGGATGGTTT ACCAGTTTTG AGCTAGGCCT CAGGAGGTCT 14700 
AGGGCATGTC TACTCTCTCT TAGGACAGCT GCCCCCACCC TGCAAAAAAG CCTGGGCTAG 14760 
CCTGCTGGAG GATGAGAGCC CACCTGGATC AGTTGTCTCA GCTGATTTCA GACACGTGAG 14820 
AGAGAGCTCA GCGAGACTCA GCTTGTAGCT GACTACAGAT GTGTGAGGGA ACCTGGCTGA 14880 



13620 
13680 
13740 
13800 
13860 
13920 



14040 
14100 

'•'14160 
14220 
14280 
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GACCAAAACA ACTGTCCAGC TGAGCCCAGG CTAAACTGCC 
AAATAAAGGC TGCTGTTCTA AGTCACTGGG TTTTGGTATG 
TAACAGGTGT AATTGGTCCT TATTCCCTTA TTCACTGAGA 
AGCTGGACTT GGAGGCCATG GAAATGCAGT GGACATGGCC 
TGTGGAAGGA GGTCAAGTTC ATGGAATAAT GGAGAACACA 

TTCAGGGAAC ACACATTTAT TGAGCACTTG CTATGTGCCA 

GGATCCAGAT ATTTAAAGAA AACAAACAAA AATCAGGTCC 

TGAGAGTGGT ATCAGCTTTT ACGAATTC 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS- 

Itl iS^S™- a^ino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



AACATGCAGA 

GTTTGTTAGG 

GTGATGGGTT 

TTTGTTCCTT 

CAGCTGTAAT 

GGCACAGTGC 

AAAACTCCTG 



ATTGTGAGCT 

CAGCCATAAC 

CTCAGCCCTG 

ACCTTGAAGC 

CGTTTGCTTG 

CAGGCAGTAG 

GGGAGAATGC 



14940 

15000 

15060 

15120 

15180 

15240 

15300 

15328 



(Xi) SEQUQ^CE DESCRIPTION: SEQ ID NO:34: 



Met Lys Leu Leu Leu Leu Ala Ala Leu Leu Thr Ala Gly Val TJr Ala 
His ser lie Ser Thr Arg Ala Val Tp Gin Phe Arg Asn Met lie Lys 
cys Tnr lie Pro Gly Ser Asp Pro Leu Arg Glu Tyr Asn ^r Gly 

cys Tvr cy. 3„ ^„ 2, 

Arg cys Cys Gin T^r His Asp His Cys Tyr Asn Gin Ala Lys Lys Leu 

80 

«» ser cy. Ly, PH, xi. ».p ^„ p„ ^ ^ ^„ Thr ^ s„ 

^r .y, CVS s.r Cly «„ v.l .1. Thr Cy. s,r «p I.ys 1 

105 110 
cy, «u s.r Phe II. cy, Ae„ cys Asp «„ ai, ji. ^, p,. 

s« Lys val Pro Tyr A.„ Lys olu Tyr Ly. Asp L,u Asp Thr Lys Lys 

His Cys 
145 
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(2) IKFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 146 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 



Met Lys Val Leu Leu Leu Leu Ala Val Val He Met Ala Phe Gly Scr 
15 10 x5 

He Gin Val Gin Gly Ser Leu Leu Glu Phe Gly Gin Met He Leu Phe 
20 25 30 

Lys Thr Gly Lys Arg Ala Asp Val Ser Tyr Gly Phe Tyr Gly cys His 
•^5 40 45 

cys Gly Val Gly Gly Arg Gly Ser Pro Lys Asp Ala Thr Asp Trp Cys 
50 55 gQ ^ 

cys Val Thr His Asp Cys Cys Tyr Asn Arg Leu Glu Lys Arg Gly Cvs 
?^ 70 75 80 

Gly Thr Lys Phe Val Thr Tyr Lys Phe Ser Tyr Arg Gly Gly Gin He 
S5 90 95 

Ser Cys Ser Thr Asn Gin Asp Ser Cys Arg Lys Gin Leu cys Gin Cys 
100 105 lie ^ 

Asp Lys Ala Ala Ala Glu Cys Phe Ala Arg Asn Lys Lys Ser Tyr Ser 
115 120 125 

Leu Lys Tyr Gin Phe Tyr Pro Asn Lys Phe Cys Lys Gly Lys Thr Pro 



140 



Ser Cys 
145 

(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 148 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

Met Lys Leu Leu Val Leu Ala Val Leu Leu Thr Val Ala Ala Ala Asp 
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1 5 10 15 

Ser Gly lie Ser Pro Arg Ala Val Trp Gin Phe Arg Lys Met He Lys 
20 25 30 

Cys Val He Pro Gly Ser Asp Pro Phe Leu Glu Tyr Asn Asn Tyr Gly 
35 40 45 

cys Tyr Cys Gly Leu Gly Gly Ser Gly Thr Pro Val Asp Glu Leu Asp 
50 55 60 

Lys cys Cys Gin Thr His Asp Asn Cys Tyr Asp Gin Ala Lys Lys Leu 
^5 70 75 80 

Asp Ser Cys Lys Phe Leu Leu Asp Asn Pro Tyr Thr His Thr Tyr Ser 
85 90 95 

Tyr Ser Cys Ser Gly Ser Ala He Thr Cys Ser Ser Lys Asn Lys Glu 
iOO 105 110 

cys Glu Ala Phe He Cys Asn Cys Asp Arg Asn Ala Ala He Cvs Phe 
115 120 125 

Ser Lys Ala Pro Tyr Asn Lys Ala His Lys Asn Leu Asp Thr Lys Lys 
130 135 140 

Tyr Cys Gin Ser 
145 

(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 144 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

Met Lys Thr Leu Leu Leu Leu Ala Val He Met He Phe Gly Leu Leu 
1 5 10 



15 



Gin Ala His Gly Asn Leu Val Asn Phe His Arg Met He Lys Leu Thr 
20 25 30 

Thr Gly Lys Glu Ala Ala Leu Ser Tyr Gly Phe Tyr Gly Cys His Cys 
35 40 45 

Gly Val Gly Gly Arg Gly Ser Pro Lys Asp Ala Thr Asp Arg Cys Cys 
50 55 60 

Val Thr His Asp Cys Cys Tyr Lys Arg Leu Glu Lys Arg Gly Cys Gly 
^5 70 75 80 

Thr Lys Phe Leu Ser Tyr Lys Phe Ser Asn Ser Gly Ser Arg He Thr 
85 90 95 
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Cys Ala Lys Gin Asp Ser Cys Arg Ser Gin Leu Cys Glu Cys Asp Lys 
100 105 110 

Ala Ala Ala Thr Cys Phe Ala Arg Asn Lys Thr Thr Tyr Asn Lys Lys 
lis 120 125 

Tyr Gin Tyr Tyr Ser Asn Lys His Cys Arg Gly Ser Thr Pro Arg Cvs 
130 135 ^ ^ 



(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 126 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

Ala Vai Trp Gin Phe Arg Lys Met He Lys Cys Val He Pro Gly Ser 
15 10 

Asp Pro Phe Leu Glu Tyr Asn Asn Tyr Gly Cys Tyr Cys Gly Leu Gly 
20 25 30 

Gly Ser Gly Thr Pro Val Asp Glu Leu Asp Lys Cys Cys Gin Thr His 
35 40 45 

Asp Asn cys Tyr Asp Gin Ala Lys Lys Leu Asp Ser Cys Lys Phe Leu 
^° 55 60 

Leu Asp Asn Pro Tyr Thr His Thr Tyr Ser Tyr Ser Cys Ser Gly Ser 

70 75 80 

Ala He Thr Cys Ser Ser Lys Asn Lys Glu Cys Glu Ala Phe He Cvs 
85 90 95 

Asn cys Asp Arg Asn Ala Ala He Cys Phe Ser Lys Ala Pro Tyr Asn 
100 105 110 

Lys Ala His Lys Asn Leu Asp Thr Lys Lys Tyr Cys Gin Ser 
115 120 125 

(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 124 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECXmE TYPE: protein 
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(xi) SEQUEKCE DESCRIPTION: SEQ ID NO:39: 

Asn Leu Val Asn Phe His Arg Met He Lys Leu Thr Thr Gly Lys Glu 

10 15 

Ala Ala Leu Ser Tyr Gly Phe Tyr Gly cys His Cys Gly Val Gly Gly 



30 



Arg Gly Ser Pro Lys Asp Ala Thr Asp Arg Cys cys Val Thr His Asp 
cys cys Tyr Lys Arg Leu Glu Lys Arg Gly Cys Gly Thr Lys Phe Leu 
ser Tyr Lys Phe Ser Asn Ser Gly Ser Arg He Thr Cys Ala Lys Gin 
ASP ser cys Arg Ser Gin Leu Cys Glu Cys Asp Lys Ala Ala Ala Thr 



90 



95 



cys Phe Ala Arg Asn Lys Thr Thr Tyr Asn Lys Lys Tyr Gin Tyr Tyr 



Ser Asn Lys His Cys Arg Gly Ser Thr Pro Arg Cys 
(2) INFORMATION FOR SEQ ID NO: 40: 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 118 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

Gly Leu Leu Asp Leu Lys Ser Met He Glu Lys Val Thr Gly Lys Asn 

Ala Leu Thr Asn Tyr Gly Phe Tyr Gly cys Tyr Cys Gly Trp Gly Gly 

25 30 

Arg Gly Thr Pro Lys Asp Gly Thr Asp Trp Cys Cys Trp Ala His Asp 

40 45 

His cys Tyr Gly Arg Leu Glu Glu Lys Gly cys Asn He Arg Thr Gin 

60 

ser Tyr Lys Tyr Arg Phe Ala Trp Gly Val Val Thr Cys Glu Pro Gly 

"75 80 

pro Phe cys His Val Asn Leu Cys Ala Cys Asp Arg Lys Leu Val Tyr 

90 95 

cys Leu Lys Arg Asn Leu Arg Ser Tyr Asn Pro Gin Tyr Gin Tyr Phe 
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100 105 110 



Pro Asn lie Leu Cys Ser 
115 

(2) INFORMATION FOR 5£Q ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 124 aaino acids 

(B) TYPE: amino acid 

(C) STRANDEONESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

Ala Val Trp Gin Phe Arg Asn Met lie Lys Cys Thr lie Pro Gly Ser 
15 10 15 

Asp Pro Leu Arg Glu Tyr Asn Asn Tyr Gly Cys Tyr Cys Gly Leu Gly 
20 25 30 

Gly Ser Gly Thr Pro Val Asp Asp Leu Asp Arg Cys Cys Gin Thr His 
35 40 45 

Asp His Cys Tyr Asn Gin Ala Lys Lys Leu Glu Ser Cys Lys Phe Leu 
50 55 60 

lie Asp Asn Pro Tyr Thr Asn Thr Tyr Ser Tyr Lys Cys Ser Gly Asn 
65 70 75 80 

Val He Thr Cys Ser Asp Lys Asn Asn Asp Cys Glu Ser Phe He Cys 
85 90 95 

Asn Cys Asp Arg Gin Ala Ala He Cys Phe Ser Lys Val Pro Tyr Asn 
100 105 110 

Lys Glu Tyr Lys Asp Leu Asp Thr Lys Lys His Cys 
115 120 

(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 125 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

Ser Leu Leu Glu Phe Gly Gin Met He Leu Phe Lys Thr Gly Lys Arg 
15 10 15 
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Ala ASP val Ser Tyr Gly Phe Tyr Gly cys His Cys Gly Val Gly Gly 

25 30 

Arg Gly Ser Pro Lys Asp Ala Thr Asp Trp cys Cys val Thr His Asp 

cys cys Tyr Asn Arg Leu Glu Lys Arg Gly cys Gly Thr Lys Phe Val 

Thr Tyr Lys Phe Ser Tyr Arg Gly Gly Gin He Ser cys Ser Thr Asn 

80 

cm ASP ser cys Arg Lys Gin Leu Cys Gin Cys Asp Lys Ala Ala Ala 
Clu cys Phe Ala Arg Asn Lys Lys Ser Tyr Ser Leu Lys Tyr Hn Phe 

Tyr Pro Asn Lys Phe Cys Lys Gly Lys Thr Pro Ser Cys 

"° 125 

(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 130 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xz) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 

ser Phe Trp Gin Phe Gin Arg Met Val Lys His He Thr Gly Arg Ser 
Ala Phe Phe Ser Tyr Tyr Gly Tyr Gly cys Tyr cys Gly Leu Gly Gly 
Arg Gly lie Pro Val Asp Ala Thr Asp Arg cys Cys Tp H His Asp 
cys cys Tyr His Lys Leu Lys Glu Tyr Gly cys Gin Pro He Leu Asn 
Ala Tyr Gin Phe Ala lie Val Asn Gly Thr Val Thr Cys Gly cys Thr 
Met Gly Gly Gly Cys Leu Cys Gly Gin Lys Ala Cys Glu Cys Asp Lys 

Leu ser Val Tyr Cys Phe Lys Glu Asn Leu Ala Thr Tyr Glu ^ys Thr 

J-os 1X0 

Phe Lys Gin Leu Phe Pro Thr Aro pt-« oi« 

115 Gly Arg Asp Lys Leu 

125 

His Cys 
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130 

(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 amino acids 

(B) TYPE: amino acid 

(C) STRAKDEDNESS : single 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

Gly Leu Leu Glu Leu Lys Ser Met lie Glu Lys Val Thr Gly Lys Asn 
15 10 15 

Ala Val Lys Asn Tyr Gly Phe Tyr Gly Cys Tyr Cys Gly Trp Gly Gly 
20 25 30 

His Gly Ttir Pro Lys Asp Gly Thr Asp Trp cys Cys Arg Met His Asp 
35 40 45 

Arg cys Tyr Gly Leu Leu Glu Glu Lys His Cys Ala He Arg Thr Gin 
50 55 60 

Ser Tyr Asp Tyr Arg Phe Thr Gin Asp Leu Val He Cys Glu His Asp 
65 70 75 80 

Ser Phe Cys Pro Val Arg Leu Cys Ala Cys Asp Arg Lys Leu Val Tyr 
85 90 95 

Cys Leu Arg Arg Asn Leu Trp Ser Tyr Asn Arg Leu Tyr Gin Tyr Tyr 
100 105 110 

Pro Asn Phe Leu Cys 
115 
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The present invention may, of course, be 
carried out in other specific ways than those herein set 
forth without departing from the spirit and essential 
characteristics of the invention. The present 

embodiments are, therefore, to be considered in all 
respects as illustrative and not restrictive and all 
changes coming within the meaning and equivalency range 
of the appended claims are intended to be embraced 
herein. 
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Having described our invention, we claim: 



1.) 



A substantially pure or isolated low molecular 



weight PLA^ enzyme having phospholipase activity, said 
enzyme being free of disulfide bridges between cysteine 
amino acids 11 and 77 and an elapid loop, said enzyme 
having at least seventeen amino acids in its sequence 
which are identical to those amino acids conserved in 
Type II PIiA2 enzymes having phospholipase activity. 

2. ) A PLiAs enzyme of claim 1, said PLAs enzyme 
having only 12 cysteine amino acids residues in its 
mature sequence . 

3. ) A PLA2 enzyme of claim 1, said PLA2 enzyme 
having only 16 cysteine amino acid residues in its 
mature sequence . 

4. ) A PLA2 enzyme of claim 1, said PLA2 enzyme 
having a molecular weight of about 14KD. 

5. ) A PLA2 enzyme of claim 1, said PLA2 enzyme 
having an amino acid sequence set forth in FIG. 3 or an 
equivalent fragment thereto or an active fragment 
thereof . 
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6^) A PLA2 enzyme of claim 1, said PLA2 enzyme 

having an amino acid sequence set forth in FIG. 11 or 
an equivalent fragment thereto or an active fragment 
thereof. 

7. ) A PIiA2 enzyme of claim 1, said PLA2 enzyme 
having an amino acid sequence set forth in FIG. 12 or 
an equivalent fragment thereto or an active fragment 
thereof. 

8. ) A PIiA2 enzyme of claim 1, said PLA2 enzyme 
having an amino acid sequence encoded for by the 
nucleotide sequence of FIG. 19 or an equivalent 
fragment thereto or an active fragment thereof. 

9. ) A PLA2 enzyme of claim 2, said PLA2 enzyme 
having an amino acid sequence which includes the 
following prepeptide amino acid sequence 
MDLLVSSGMKGIAVFLVFIFC. 



10.) A PLA2 enzyme of claim 2, said PLiA2 enzyme 

having an amino acid sequence which includes the 
following propeptide amino acid sequence WTTSTLS. 
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11.) A PLA2 enzyme of claim 2, said PLA2 enzyme 

having the following features : 

a. ) a phenylalanine residue conserved at 
position 5 in the mature sequence; 

b. ) a methionine residue conserved at 
position 8 in the mature sequence; 

c. ) a histidine residue conserved at position 
4 8 and an aspartic acid residue At position 4 9 in the 
mature sequence; 

d. ) a valine residue conserved at position 
9 in the mature sequence; and 

e. ) being free of alanine residues at 
positions 102 and 103 in the mature sequence, 

12 . ) A PL1A2 enzyme of claim 2 , said PLAj enzyme 
having a YGCYCG Ca^* binding loop. 

13. ) A PLA2 enzyme of claim 3, said PLA2 enzyme 
having an amino acid sequence which includes a 
prepeptide amino acid sequence selected from a group 
consisting of MKGLLPLAWFLACS VPAVQG and 
MKRLLTLiAWFLACSVPAVPG . 
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14.) 



A PLA2 enzyme of claim 3 , said PLAj enzyme 



having the following features: 

a. ) an isoleucine residue conserved at 
position 9 in the mature sequence; 

b. ) a methionine residue conserved at 
position 8 in the mature sequence; 

c. ) a histidine residue conserved at position 
48 and an aspartic acid residue conserved at position 

4 9 in the mature sequence; 

d. ) a leucine residue conserved at position 

5 in the mature secjuence; and 

e. ) being free of alanine residues at 
positions 102 and 103 in the mature sequence. 

15. ) A PliAj enzyme of claim 3, said PLA2 enzyme 
having a YGCYCG Ca^"^ binding loop . 

16 . ) A PIjAj enzyme of claim 1, said PLAs enzyme 
being a Type III PLAj enzyme or an equivalent fragment 
thereto or an active fragment thereof, 

17 . ) A PLA2 enzyme of claim 1 , said PIxAj enzyme 
being a TYPE IV PLAj enzyme or an equivalent fragment 
thereto or an active fragment thereof. 
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18 •) A PLA2 enzyme of claim 1, said PLA2 enzyme 

further including a COOH-terminal amino acid 
ex-tension. 

19. ) A PIiA2 enzyme of claim 18, said PLA2 enzyme 
being a Type III PLA2 enzyme or an equivalent 
fragment thereto or an active fragment thereof, said 
Type III PIiA2 enzyme having an about seven amino 
acids COOH-terminal extension. 

20. ) A PIiA2 enzyme of claim 19, said seven amino 
acids COOH-terminal extension having the following 
amino acid sequence GRDKI-HC, said Type III PLA2 
enzyme being a rat Type III PI1A2 enzyme. 

21. ) A PLA2 enzyme of claim 18, said PLA2 enzyme 
being a Type IV PLA2 enzyme or an equivalent fragment 
thereto or an active fragment thereof, said type IV 
PLA2 enzyme having an about one amino acid 
COOH-terminal extension. 

22. ) A PIiA2 enzyme of claim 21, said one amino 
acid COOH-terminal extension having a serine amino 
acid COOH-terminal extension, said Type IV PLA2 
enzyme being a human Type IV PIiA2 enzyme. 
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23. ) A substantially pure or isolated nucleotide 
sequence coding for a polypeptide having 
phospholipase activity, the polypeptide having no 
disulfide bridges between cysteine amino acids 11 and 
77 and no elapid loops, 

24. ) A nucleotide sequence of claim 23, the 
polypeptide having the amino acid sequence set forth 
in FIG. 3 or an equivalent fragment thereto or an 
active fragment thereof. 

25. ) A nucleotide sequence of claim 23, the 
polypeptide having the amino acid sequence set forth 
in FIG . 11 or an equivalent fragment thereto or an 
active fragment thereof. 

26. ) A nucleotide sequence of claim 23, the 
polypeptide having the amino acid sequence set forth 
in FIG. 12 or an equivalent fragment thereto or an 
active fragment thereof. 

27. ) A nucleotide sequence of claim 23, the 
polypeptide having the amino acid sequence encoded by 
the nucleotide sequence set forth in FIG. 19 or an 
equivalent fragment thereto or an active fragment 
thereof , 
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28. ) A nucleotide secjuence of claim 23, the 
polypeptide sequence having: 

a. ) a phenylalanine residue conseirved at 
position 5 in the mature amino acid sequence; 

b. ) a methionine residue conserved at 
position 8 in the mature amino acid sequence; 

c. ) a histidine residue conserved at position 
48 and an aspartic acid residue conserved at position 
4 9 in the mature amino acid sequence; and 

d. ) being free of alanine residues at 
positions 102 and 103 in the mature amino acid sequence. 

29. ) A nucleotide sequence of claim 28, the 
polypeptide sequence having only 16 cysteine residues 
in its mature amino acid sequence. 

30. ) A nucleotide sequence of claim 29, the 
polypeptide sequence including the following prepeptide 
amino acid sequence MDLLVSSGMKGIAVFLVFIFC . 
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31. ) A nucleotide sequence of claim 22, the 
polypeptide sequence having: 

a. ) an isoleucine residue conserved at 
position 9 in the mature amino acid sequence; 

b. ) a methionine residue conserved at 
position 8 in the mature amino acid sequence; 

c. ) a histidine residue conserved at position 
4 8 and an aspartic acid residue conserved at position 
4 9 in the amino acid sequence; 

d. ) 12 cysteine residues in the mature amino 
acid sequence ; and 

e. ) being free of alanine residues at 
position 102 and 103 in the mature amino acid sequence. 

32. ) A nucleotide sequence of claim 29, the 
polypeptide sequence including the following propeptide 
amino acid sequence WTTSTLS . 

33. ) A nucleotide sequence of claim 31, the 
polypeptide sequence including a prepeptide amino acid 
seq[uence selected from a group consisting of 
MKGLLPLiAWFIiACSVPAVQG and MKRLLTLANFLACSVPAVPG . 
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34. ) A nucleotide sequence of claim 22, the 
polypeptide being a Type III PIiA2 enzyme or an 
equivalent fragment thereto or an active fragment 
thereof. 

35. ) A nucleotide sequence of claim 23, the 
polypeptide being a Type IV PIA2 enzyme or an 
equivalent fragment thereto or an active fragment 
thereof. 

3^-) A nucleotide sequence of claim 22, the 

polypeptide further including a COOH-terminal amino 
acid extension. 

^'7.) A nucleotide sequence of claim 36, the 

polypeptide being a Type III PLA2 enzyme or an 
equivalent fragment thereto or active fragment 
thereof, said Type III PLA2 enzyme having an about 
seven amino acids COOH-terminal extension. 

38.) A nucleotide sequence of claim 37, said 

seven amino acids COOH-terminal extension having the 
following amino acid sequence GRDKLHC, said Type III 
PLA2 enzyme being a rat Type III PLA2 enzyme. 
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39. ) A nucleotide sequence of claim 36, the 
polypeptide being a Type IV PIA2 enzyme or an 
equivalent fragment thereto or active fragment 
thereof, said Type IV PLA2 enzyme having an about one 
amino acid COOH-terminal extension. 

40, ) A nucleotide sequence of claim 39, said one 
amino acid COOH-terminal extension being a serine 
amino acid COOH-terminal extension, said Type IV PLA2 
enzyme being a human Type IV PI1A2 enzyme. 
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41. ) A recombinant DNA expression vector 
comprising: 

a first DNA segment having a nucleotide 
sequence containing bases whose translated region 
^ codes for a PIiA2 enzyme selected from a group 
consisting of Type III and Type IV or an equivalent 
fragment thereto or an active fragment thereof; and 

a second DNA segment heterologous to said 
first DNA segment wherein said first DNA segment is 
10 operably linked to said second DNA segment* 

42. ) A recombinant DNA expression vector of 
claim 41, said first DNA segment having the 
nucleotide sequence set forth in FIG, 3 or an 
equivalent fragment thereto or an active fragment 
thereof • 

4 3.) A recombinant DNA expression vector of 

claim 41, said first DNA segment having the 

nucleotide sequence set forth in FIG. 11 or an 

equivalent fragment thereto or an active fragment 
thereof. 
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44..) A recombinant DNA expression vector of 

claim 41, said first DNA segment having the 
nucleotide secjuence set forth in FIG. 12 or an 
equivalent fragment thereto or an active fragment 
thereof. 

45. ) A recombinant DNA expression vector of 
claim 41, said first DNA segment having the 
nucleotide sequence set forth in FIG. 19 or an 
equivalent fragment thereto or an active fragment 
thereof. 

46. ) A recombinant expression vector of claim 
41, said vector being pCHlO. 

47. ) A recoinbinant expression vector of claim 
41, said vector being pR8-3'. 

48. ) A host transfected with said recombinant 
expression vector of claim 41. 

49 •) A host of claim 48, said host being a cell 

line. 

50.) A host of claim 49, said cell line being a 

cell line designated as CpCHlO-lD. 
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51. ) A host of claim AS, said cell line being a 
cell line selected from a gr up consisting of 
CpCHlO-lB, CpCHlO-lC and CpCH10-2G. 

52. ) A host of claim 49, said cell line being a 
cell line designated as CpR8-3'. 
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53 . ) A cDNA encoding a phospholipase enzyme 
having phospholipase activity, said phospholipase 
enzyme being selected from a group consisting of Type 
III and Type IV, including equivalent fragments 
thereto and active fragments thereof. 

54. ) A cDNA of claim 53, said phospholipase 
enzyme being RPL1A2-8 or an equivalent fragment 
thereto or an active fragment thereof. 

55. ) A cDNA of claim 53, said phospholipase 
enzyme being HPIiA2-10 or an equivalent fragment 
thereto or an active fragment thereof. 

56. ) A cDNA of claim 53, said phospholipase 
enzyme being RPIiA2-10 or an equivalent fragment 
thereto or an active fragment thereof - 



BNSDOCID: <WO 9502328Al_L> 



SUBSTITUTE SHEET (RULE 26) 



wo 95/02328 PCT/US94/07926 

-99" 

57. ) A method of producing a PIA2 enzyme 
selected from a group consisting of Type III and Type 
IV or an equivalent fragment thereto or an active 
fragment thereof, said method comprising: 

^ aO inserting a recombinant expression 

vector into a host by transf ection, said recombinant 
expression vector having a nucleotide sequence 
containing bases whose translated region codes for 
the Type III or Type IV PLA2 enzyme or an equivalent 
•LO fragment thereto or an active fragment thereof; 

b. ) cultivating the transf ected host; and 

c. ) expressing the Type III or Type IV 
PLA2 enzyme or an equivalent fragment thereto or an 
active fragment thereof by the transf ected host. 

58. ) A method of claim 57, said cultivating step 
comprises growing the host in a cell culture medium* 



59. ) A method of claim 57, said cultivating step 
comprises introducing the host into an animal. 

60. ) A method of claim 57, the host being an 
eukaryotic cell. 

61. ) A method of claim 57, the host being a 
prokaryotic cell. 
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62. ) A method of expressing a Type III or Type 
IV PLA2 enzyme or an equivalent fragment thereto or 
an active fragment thereof in an animal comprising: 

introducing a nucleotide sequence 

containing bases whose translated region codes for 
the Type III or Type IV PIiA2 enzyme or an equivalent 
fragment thereto or an active fragment thereof; and 

expressing the nucleotide sequence in the 

animal • 

63. ) A method of claim 62, said nucleotide 
sequence including the nucleotide sequence set forth 
in FIG. 3 or an equivalent fragment thereto or an 
active fragment thereof. 

64 . ) A method of claim 62 , said nucleotide 
sequence including the nucleotide sequence set forth 
in FIG. 11 or an equivalent fragment thereto or an 
active fragment thereof. 

65. ) A method of claim 62, said nucleotide 
sequence including the nucleotide sequence set forth 
in FIG . 12 or an equivalent fragment thereto or an 
active fragment thereof. 
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66. ) A metzhod of claim 62, said nucleotide 
sequence including the nucleotide sequence set forth 
in FIG. 19 or an equivalent fragment thereto or an 
active fragment thereof, 

67 . ) A method of claim 62 , said introduction 
step comprises introducing a recombinant expression 
vector into the animal, the recombinant expression 
vector having the nucleotide sequence. 

68. ) A method of claim 62, said introduction 
step comprises introducing the nucleotide sequence 
into the genome of an animal. 
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69^) A substantially pure or isolated antisense 

nucleotide sequence which has the ability to inhibit 
or interfere with expression of a gene or mRNA 
transcript encoding for a Type III PLA2 enzyme or an 
amino acid sequence which is an equivalent thereto or 
an active fragment thereof. 
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70.) A substantially pure or isolated antisense 

nucleotide sequence which has the ability to inhibit 
or interfere with expression of a gene or loRNA 
transcript encoding for a Type IV PIiA2 enzyme or an 
amino acid sequence which is an equivalent thereto or 
an active fragment thereof. 
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71. ) A substantially pure or isolated Type III 
PLA2 enzyme, or an equivalent fragment thereto or an 
active fragment thereof, said PIiA2 enzyme having 
phospholipase activity which is significant at a pH 
of between about 7 and about 9 and at a calcium 
concentration of between about 0.3 mM and about 2 mM. 

72. ) A Type III PLA2 enzyme of claim 71, said 
phospholipase activity progressively declining at a 
pH which is greater than about 9 and at a calcium 
concentration which is greater than about 2 mM. 
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73. ) A substantially pure or isolated Type IV 
PIiA2 enzyme, or an equivalent fragment thereto or an 
active fragment thereof, said PIiA2 enzyme having 
phospholipase activity which is significant at a pH 
of between about 6-5 and about 7,5 and at a calcium 
concentration of between about 7 mM and about 100 mM» 

74. ) A Type IV PLA2 enzyme of claim 73, said 
phospholipase activity progressively declining at a 
calcium concentration of greater than about 100 mM, 
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75. ) A substantially pure or isolated nucleotide 
sequence having an internal ribosome binding site 
which allows for internal initiation of 
cap-* independent mRNA translation, said nucleotid 
sequence including bases 116-720 designated in FIG. 3 
or an equivalent fragment thereto or an active 
fragment thereof. 

76. ) A nucleotide sequence of claim 75, said 
nucleotide sequence being operably linked to a second 
nucleotide sequence heterologous to said nucleotide 
sequence . 

77. ) A nucleotide sequence of claim 76, said 
second nucleotide sequence containing bases whose 
translated region encodes for lucif erase. 

78. ) A nucleotide sequence of claim 76, said 
second nucleotide seqpjence containing bases whose 
translated region encodes for a Type III PLA2 enzyme 
or an equivalent fragment thereto or an active 
fragment thereof. 

79. ) A nucleotide sequence of claim 78, said 
second nucleotide sequence includes the nucleotide 
sequence set forth in FIG. 3 or an equivalent 
fragment thereto or an active fragment thereof. 
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80. ) A nucleotide sequence of claim 78, said 
second nucleotide sequence includes the nucleotide 
sequence set forth in FIG. 19 or an equivalent 
fragment thereto or an active fragment thereof. 

81. ) A nucleotide sequence of claim 76, said 
second nucleotide sequence containing bases whose 
translated region encodes for a Type IV PL1A2 enzyme 
or an equivalent fragment thereto or an active 
fragment thereof. 

82. ) A nucleotide sequence of claim 81, said 
second nucleotide sec[uence includes the nucleotide 
sequence set forth in FIG. 11 or an equivalent 
fragment thereto or an active fragment thereof. 

83. ) A nucleotide sequence of claim 81, said 
second nucleotide sequence includes the nucleotide 
sequence set forth in FIG. 12 or an equivalent 
fragment thereto or an active fragment thereof, 

84. ) A nucleotide sequence of claim 86, a 
recombinant expression vector including said 
nucleotide sequence operably linked to said second 
nucleotide sequence heterologous to said nucleotide 
sequence . 
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Pig. 1 



RPLA2-8 eONA Structure 



200bp 
ATG TGA 

W 



8 128 722 1198 W1478 



4385 
1358 

Represents 121 bp repeat sequence ■ Represents RPLA2-8 open reading frame. 
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rig. 2 



RPLA2-8 cDNA Secondary Structure 




200bp 

4385 
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Fig. 3 RFLX2-*S cDHA and Derived J^iso Acid Se^uenee 

(1/5) 

10 20 30 40 50 60 

GAATTCCG CCTCCACCTCrCAAATGCTGGGATTGCAGGATGTCCCCCCACCCCTGCTCCC 
clone linker 

70 80 90 100 110 120 

TTSTCTggrTG gTTgCTGOTGCCGGAATGTATrj^ gTTAAW 

Pla8-8 (pri&er) 
130 140 150 160 170 180 

ATTCCAGCATAGAAGGGCCGGCCAGCGCGTTGGAGGACAGGCCTCTATTATTTCCGCGGT 

190 200 210 220 230 240 

Pla8-'7 (priaer) 
250 260 270 280 290 300 

AAGGCTGGAACAGCAATGCAGACCTAGGTAAAGGGCACAGAGCTGAGGGAAGCTCCTGGG 

310 320 330 340 350 360 

AGCCTCCCTGCAGCTCCTGCCTCTGCACATGACCCGGACTCCTTTTCTCTCTTTCGATCT 

370 3^80 390 400 410 420 

GCGTCCAGGGACTGGCTTGTACACACCCCTCCCAGGAGACCCCTTGGCAGCTGCACACTC 

430 440 450 460 470 480 

AGGCTCCATCCAAGTTCGCTCTGCCCCTGGGGAAGGCTGCTCAAAAGCCCTGGCTCCCAG 

490 500 510 520 530 540 

TTTCTGGGGACCCACAGAGAGCCTCTCACCTCGCAGCTCAGCTCCATCCGCCTCCTGTGC • ■ 

550 560 570 580 590 600 

CrGGCTGCGACCAGCTGGGTCTAACTATAGACAGTCAGCAACTTCAGCCACTTCACCGAC 

610 620 630 640 650 660 

TTTCCCAACAGCTTTGAGATTTGGAAGCCGGAAGCCTGACTGCCrTCTCAGAAGCTACGG. 

670 680 690 700 710 720 

TCCACTACCrrCAGCCATTCTGTTGGAGCTGAACTGGCAGATGAAGGTGAGACCCA GGCAC 

730 740 750 760 770 780 

CATGGACCTCCTGGTCrrCCrrCAGGAATGAAGSSgATgGrTr:Tr-T^rr^f7Tr^^ 

MetAspLeuLeuVaiSerSerGlyMetLysGlylleAlaValPheLeuValPhellePh 
Rclo8-5* (priaer) 

790 800 810 820 830 840 

CTGCTGGACAACCTCCACCCTCAGCAG CTTCTGGCAGTTCCAGAGGATGC TgAAAgArii^ 

eCysTrpThrThrSerThrLeuSerSerPheTrpGlnPheGlnArgMetiValLysHisIl 

Pla8-l (primer) 
850 8 60 870 880 890 900 

CACGGGGCGCAGCGCCTTCrrCTCCTATTACGGATATGGCTGCTACTGTGGGCTTGGGGG 
eThrGlyArgSerAlaPhePheSerTyrTyrGlyTyrGlyCysTyrCysGlyLeuGlyGl 

910 920 930 940 950 960 

CCGAGGGArCCCTGTGGACGCCACAGACXGGTGCrrGtrrGGr.r-rri.'rn.^r^r-.y^r^^^ 

yArgGlylleProValAspAlaThrAspArgCysCysTrpAlaHisAspCysCysTyrHi 

RPLA2-8 cDNA sequence corresponds to SEQ ID NO: 21; and Derived 
Amino Acid sequence corresponds to SEQ ID NO: 22:. 
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PIG. 3 (2/5) 

Pla8-2 (primer) 

970 980 990 1000 1010 1020 

CKkZZ. . AAGGAATATGGCTGCCAGCCCATCTTGAATGCCTATCASTrTGCCArTGTCAA 
sLysLeuLysGluTyrClyCysGlnProIleLeuAsnAlaTyrGlnPheAlalleValAs 

1030 1040 1050 1060 1070 1080 

CGGGACCGTGACCTCTGGATGCACCATGCGTGGCGGCTGCTTGTGCGGGCAGAAAGCCTG 
nGlyTtirValThrCysGlyCysThrMetGlyClyGlycysLeuCysGlycinLysAlaCy 

1090 1100 1110 1120 1130 1140 

7GAG7GTGACAAACTGTCTGTGTACTGCTTCAAGGAGAACCTGGCCACCTACGAGAAAAC 
sGluCysAspLysLeuSerValTyrCysPheLysGluAsnLeuAlaThrTyrGluLysTh 

1150 1160 1170 1180 1190 1200 

TTTCAAGCAGCTCTTCCCCACCAGCCCCCACTGTGGC AGCSAgAAAgTrrATTGCTAGGC 
rPheLysGlnLeuPheProThrArgProGlnCysGlyArgAspLysLeuHisCysEnd 

Rclo8-3« (primer) 

1210 1220 1230 1240 1250 1260 

1270 1280 1290 1300 1310 1320 

CTTCCCCTCCAAGAGTCCCCAGCCTCCTGCAGCTCAGCCTTGCTCTCTAGGGAGTGTCTT 

1330 1340 1350 1360 1370 1380 

CTCAGGCATTAGGGGACCGGAGGTGGAGAATTCCTGCCCTGGAATCAGACCATGGGTACC 

1390 1400 . 1410 1420 1430 1440 

TGGCAATTAAGTGATACATTCCGGCAGCAGGAAGCAACGACACAAGGGAGCAGGGGTCGC 

1450 1460 1470 1480 1490 1500 

GGGACATCCTGCAATCCCAGCATTTGAGAGGTGGAGGCAAGAGGTGGGGGGTAGCCTCCA 

1510 1520 1530 1540 1550 1560 

CT\TACGGTAAGTTCAAGGCTAACCTGAGCTACCTGAGACCrrGCCTTGAAAAAACTTTT 

1570 1580 1590 1600 1610 1620 

TTAAAAAACGTTTAAAGGAAAAGAAAACAGAAAGACACGGGGACTGGGCTGAAAGGTACT 

1630 1640 1650 1660 1670 1680 

CTCAAACCGATTrCCCAGGAAGAGCGCACAGCCCCAGGTTCAGCTCCAGCCTGAACTCCC 

I'^SO 1730 1740 

CCATACCCTCAGTCCTGGTCAGGATGTGTGTCTGACTGGGGAACCAAGTCCTCCACCCCG 

1750 1760 1770 1780 1790 1800 

GTCGAGCTTAGCTGGGAACTACGCAGGTGTCCTAGAAAATACAGTCCTAAGAGCCTCACC 

1810 1820 1830 1840 1850 I860 

CGGAGTCTCATCCCCATTTGCTCCAGGACTGACCTCTGTAAATCTTCCAGCAGGAAGCAG 

1870 1880 1890 1900 1910 1920 

GCTGTACCCATCTCAGGAGGTGCGGTGCTGTTAGAACAATGGTGTGCACCAGTGACACAA 



1930 1940 1950 1960 1970 



1980 
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FIG. 3 (3/5) 

AGATGTCATCCTTAAGATGGCATCAAGAASTGGAAAGGACATTC3GAACAGTGGGTCCAA 
1990 2000 2010 2020 2030 30'n 

GCCACCCAAJVGTCCTCACCCCAArrTAGAAGCCGTTGGTCCTGTAJ.GACTTAAATCTACT 
20S0 2060 2070 2080 2090 ->inn 

AAACAACGAAGGTCTAACTGCGCTGGAATCTGAACTTCATGGTGCCAGl?rGGGGCaG?S 
2X10 2120 2130 2140 sictn -ii^n 

GGTGGGGACCTGGCCGTGGCCATCACCATGATTGCCTCTCTGCATGaTGACACrTG?m 

2170 2180 2190 2200 2230 

TTGCACCCTAGCTCTCAGCACATCTGAAAAGGACAGACTCTCCTGTTCA?TCC:rrGi^?? 

2230 2240 2250 2260 - 

TGAGACTCTCCrrCACTAATGTACCAAAAATGGAGGT"fSAGTGCAG§^CAGCC^§?S 

2290 2300 2310 235o 

AG=TCCACSaCA5GAa0M5CT=G05CICA=CCTC=T=5^SGATGAiM?TT5CCC5lCT? 

2410 2420 2430 2440 

gagccctgactgcagcacacagagccgtgagcctca?1JLgaagcI1cattttggg^^ 

24.70 2480 2490 2Soo 

gcttcagggtggctgattccacagctgttgggttcaI"2ggaagc?gggagcactca^ 

TCAGAT"GGAAGCTTl^ACCA"|?rrAGCA§"STTCAGG"SGAACTTy?? 
2590 2600 2fil0 '^^^n 

CTGA=C=GA=AST=eGTACCACAITrrTATAS=AT=«IIcACAGA=«A=c=G=:=?laA 

CTAAGcll??=AACGAC«ATCATAT?S§ACCTAAc"STTAA<rr^l«CCTTAl«^ 

2710 2720 2730 5-7An 

C==T=ACT=CS=ATTASTCA=STCAec«CTTTTCGl^?TTT=AAT^CTTTC«olS 
2770 2780 2790 ^ann 

CA=GTCCCTTCCTGGACTTT«CT«GAATTTA=AAlS;;AA=«Ai?GLGA=ST«l? 
AA=.rxl|2ScA0TCAl?JS«===A|g?ATT=A^||gx„^^^ 

TO=GATAAGA=TAT=TC=:TCTCTTACACCCCTTACTiS?AACAGAlJiJ=ACGAG«J? 

CTGAAG!II2AG=ACA«I^AGGT??;SGGG=A=II2LcATGAil??GGTTTTGA?? 

"10 «J0 ,030 30.0 3050 3060 
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CTGCCCGCAAGGTGGATCTTGTTCCTGAGAAGGCTGGACTCAGGAAACTTCC7CTTAACA 

3070 3080 3090 3100 3110 di^q 

ASrTAGTTGATGCCGCTGGTCCTTAGTCACCGATACTGTCACGCTCTCAGCTCTTSCGCC 

3130 3140 3150 3160 317C 3180 

AGACTTGGCCGCCATCCGAGTGTGGTCACTTGCCCCCTCCCCTTCTTCCAGGAGGTACTG 

3190 3200 3210 3220 3230 3240 

GGGAAAATGGTTGGATTTGTCGAGTTGTAGCCAACACTCATGGCTCCCTTCACrrAGTAG 

3250 3260 3270 3280 3290 3300 

G7CAGCTAACATATGTGTATCCACCCCATACCGTGTGCCATGTCCAGTGCTGAGCAGCAG 

3310 3320 3330 3340 3350 3360 

GGAGTCAGAGATTTAAAGACACACACACAGACTTCAACTCTGACAATTTTGAATCCCAGG 

3370 3380 3390 3400 3410 3420 

GAGAACCGCTGAGAGCCATGGCGCTTCTACCAATGCCAGAGGCTAACACCCGGACTGAGA 

3430 3440 3450 3460 3470 3480 

AAACTAAGCACGAGGACACACCAGGGTCAGCAGAGGGCCTGGGAGCTACGGCCCTGAGCA 

3490 3500 3510 3S20 3530 3540 

GTACCTAGTTCAAATCACAGAGTCGTCTTTCTTCCTCCACCCTACCCAGGTACAGCAAGT 

3550 3560 3570 3580 3590 3fioo 

AGACACCGGTGGGCGCACGGCAGGGATGCAGGAACATTAGGGCACACCGATGTGCCTAGG 

3610 3620 3630 3640 3650 "ificn 

CTAAGCTAGAGCATGTTACCTTCTCAGGGGTCCTGTCATGTCAGAGACTGGTTCCAACCT 

^^8° 3690 3700 3710 3720 

Gv^AAAGATGTCTGAGTGACAGCTGTGGTAGAAGAAGAGAGGCCAGGCTGATATCAGCATG 

3730 3740 3750 3760 3770 -t-jan 

AAGGGCTGGATTGCTATGTGAGATCCyVGATCTCTTCTGCCACTGGGGTCAGCTTCTACAC 

3790 3800 3810 3820 3830 inAO 

TGGAAATAGATGGGCTGCGTTATGGAGGGTGGTGTGAGTCCCTGTCTGCGTTGTCCCCGG 

3850 3860 3870 3880 3890 lonn 

AATCAGAGCAGAGTGTTAGCGCTCTAAAAGGACATGCTGGTGCTTGCAGGAAATCATCGA 

3910 3920 3930 3940 3050 

TTTCTTGCAAGGGCAGCCATTCATCTACACCAGGGATTGACrrrTATGCCAGGCTTGTG^ 

3970 3980 3990 4OOO 4010 jLn->n 

GAGGGTAGAAAAGTAGAAATTCTGTCCGCTGCAAGGAGCAGTCAGAGGACACAAGG^CCA 

4030 4040 4050 4060 4070 inan 

AATAGCTTGGGAGTTGCGGAAGTAGGTGTCTGCTGAGGGAGCAGTGACCACTGGGGGAAA 

4090 4100 4110 4120 4130 4140 
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GGCTCCTTCAAGGAATTCAGCGACAGGGGTGAGGGCTGACATCTTGCCrGAGACCCTAAA 

4150 4160 4170 4180 4150 4200 

GAASAGAAGGAGTTGAGAGGGCTGAGTATGCTGTGTCGAGCCCCACCCrrACrrrCACrr 

4210 4220 4230 4240 42=0 4260 

CCACCCCCACCCCAGGTATATCGATCGAGGATAATGCGCCGGTCGGCrTCCTCTCAAATC 

4270 4280 4290 4300 4310 4320 

CATCATCCCACCrrCGAGCTGCTGGCACGGCCTTGCCACCACACCCCGATTCTGTGTTGA 

4330 4340 4350 4360 4370 4380 

CAAAATACTCGAACGAAATGATTACATG CAAATAA A aT GCAAGAGSAAAAATCT A & & CSG 

Polyadenylation site 

^ftTTC 

clone linlcer 
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o t.e t.o a.o ••^ ''^•^ ^^'^ ^^'^ i>.o» 

X a * 



^5 A 



HExon s«^«ne« Olntron 9«qu«ne« ^Opcn reading fram* 
•^NentransUUd region 
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Fig. 6 Coaparisoa of RFLX2«*6 D dueed Amino Xcid Seque&ca and 
Rat FLA2 Type I Aaifio Acid Seqiienea 



1 MetAspLeuLcuValSerSerGlyMe-tLysGlyllftAlaValPheLcuValPhellePhe 

I I I 5 5 : : : | 

1 MetLysLeuLe\il»euLeuAlaAlaLeu LeuThrAla GlyVal Thr 

2 1 CysTrpThrThrSerThrLeuSerSerPheTrpGlnPheGlnArgMetVa ILys His 

II • III I : I 

16 AlaHisSerllcSerThrArgAlaVal TrpGlnPheArgAsnMetlleLysCysThr 

4 0 IleThrGlyArgSerAlaPhePheSertyrlVriSlyTyrGlycys^^ 

3 5 IleProGlySerAspProLexiArgGluTyrAsnAsnT^TrciyCysTy 

60 GlyArgGlylleProValAspAlaTtirAspArgCysCysTrpAlaHisAspCysCysTyr 

I I III I I I I : I I II 

55 GlySerGlyThrProValAspAspLeuAspArgCysCysGlnThrHisAspHisCysTyr 

8 0 HisLysLeuLysGluTyrGly CysGlnProlleLeu AsnAlaTyr Gin 

' I : : I I : 

75 AsnGlnAlaLyfiLysLeuGluSerCysLysPheLeuIlc^spAsnProTyrThrAsnThr 

9 6 PheAla HeValAsnGlyThrValThrCysGlyCysTHrMetGlyGlyGlyCysLeu 
95 TyrSerTyrLysCysSerGlyAsnVallleThr CysSerAspLysAsnAsnAsp 

115 CysGlyGlnLysAlaCysGluCysAspLysLeuSerValTyrCysPheLysGluAsnLeu 

I III:: II 

113 CysGluSerPhelleCysAsnCysAspArgGlnAlaAlalleCysPhe SerLysVal 



135 



AlaThrTyrGluLysThrPheLysGlnLeuPheProThrArgProGlnCysGlyArgAsp 
2 Pro TyrAsnl^ysGluTyrLysAspLeu AsprirLys 



155 LysLeuHisCys 

I I i 

144 Lys HisCys 

Matches « 56 Misnarches « 84 Unmatched « 24 

Length « 164 Matches /length » 34.1 percent 

Top line is RPLA2-8 deduced amino acid sequence; bottom line is rat 
type I P1A2 amino acid sequence, a vertical line indicates a match, 
a conservative substitution, and no symbols a mismatch. 



Top line is SEQ ID NO:22:; 
Bottom line is SEQ ID NO:34; 
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Fig. 9 CemparisoB et tit* RFULS-S Deduced Amino Acid Seqaieaee 
and Rat Type Z2 Aaiao xeid Sequence 



1 MetAspLeuLeuValSerSerClyMetLysGlyllcAlaValPhcLeuValPhellePhe 

I 5 I •• : : I I- : : | 

1 MetLysValLeuLeu Leu LeuAlaVal VallleMe-rAlaPhe 

21 CysTrpThrThrSerThrLeuSerScrPheTrpGlnPheGlnArgMetvalLysHisIle 

I I I : 

15 Gly SerlleClnValGlnGlySerLeuLeuGlxiPheGlyGlnMetlleLeuPheLys 

<l ThrGly ArgSerAlaPhePheSerTyr TyrGlyTyrGlyCysTyrCysGlyLeu 

II i * i i : III II: 
34 ThrGlyLysArgAlaAspVal SerTyrGlyPhe TyrGlyCysHisCysGlyVal 

59 GlyGlyArgGlylleProValAspAlaThrAspArgCysCysTrpAlaHisAspCysCys 

i I ' I I I I I I II : I I I I 

52 GlyGlyArgGlySerProLysAspAlaThrAspTrpCysCysValThrHisAspcysCys 

7 9 TyrHisLysLeuLy sGluTyrGlycysGlnProI leLeuAsnAlaTyrG InPheAlal 1 e 

1*1 II : i I : 

7 2 Tyr AsnArgLcucluLysArgG lyCysGlyThrLysPheValThrTyrLy sPhcSerTyr 

99 ValAsnGlyThrValThrCysGlyCysThrMetGlyGlyGlycysLeuCysGlyClnLys 

I : : I I I 

52 ArgGlyGlyGlnlleSerCysSerThrAsn GlnAspSerCysArg LysGlnLeu 

119 AlaCysGlucysAspLysLeuSerValTyrCysPheLysGluAsnLeuAlaThrTyrGlu 
'III- ■ ' 



110 



CysGlnCysAspLysAlaAlaAlaGluCysPheAlaArgAsnLysLysSerTyrS 



er 



13 9 LysThrPheLysGlnLeuPheProThrArgProGlnCys GlyArgAspLysLeuHis 

12s LeuLysTyr GlnPheTyrProAsnLys PheCysLysclyLysThrPro Ser 

1=3 cys 

I 

1-S6 Cys 

Matches « 56 Mismatches ■ 87 Unmatched - I8 

Length » 161 • Matches/ length « 34.8 percent 

Top line is RPLA2-8 deduced amino acid sequence; bottom line is rat 
type II amino acid sequence. 1 indicates match, : a consevative 
substitution and no symbol, a mismatch. 



Top line is SEQ ID NO: 22:; 
Bottom line is SEQ ID NO: 35:. 
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3- RACE-RT-PCR 



»"™^A AAAAAAAAA 

^ cONA synthesis wrth {dT)17 

cDNA umiiiii 

^ First round PCR 

H10-A — ► I ** 
1 Second round PCR 



H10-1 



I 



1 1 1 1 1 n 1 1 i ^adaptor sequence 
^ Adaptor 



Determination of DNA sequence. Synthesis of primer HIO-C 

* 

5* RACE-RT-PCR 

AAAAAA 1 . miimii 

L cONA synthesis with (tfT) 1 7 and dA tailing 
■ r wrth TdT 

AAAAAA 



taptor X Ft 



HiO-C 



Anchors-adaptor 1 First round PCR 



adaptor sequence^AAAAA 

Adaptor ^ , ^ — HlO-la 

Second round PCR 



I 



Determination of ONA sequertce 



FIG. 10 
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Fig. IX XPIA2-10 eOlA ud Deriv d Amis Acid saffuenea 

(1/3) 

10 20 30 40 50 60 

S&&ZZ£££S76GATGGAGGGSGCTGAGCAGGAT6TTSACTGGCTATCSTTCATTGAGCAC 
Clone linker 

70 80 90 100 110 120 

TCTCACGATCAGCATCACGCACGGAATCCJITCCTTCCTGTGTTGCAGCTTGTAGACCCTG 

130 140 150 160 170 180 

ATGCTTGGGCTGCCAGCATAAACGTGGGCATCCAGACTCTGTCTACCGAGGCTGCCCATA 

aaasSSSSHrteeaggccrtgtccta-tgggcagcagcctcggtiagacagagt. . . . 
Clone linker 

"0 200 210 220 230 240 

GGGACAGGCCCTGGGAAGAGGAGCTGAGACCAGGCTAAAAAGAA CCCAAGAAATeAACgt; 

MetLysAr 

250 260 270 280 290 300 

CCTCCTCA?GgTgGCTTGCTTCCTCGCTT6CAGTGTGCeTGCAGTC CCAGCGGGgTTGCT 

gLeuLeuThrLexiAlaTrpPheLexaAlaCysSerValProAlaValProGlyGlyLeuL 
RclolO-5 ' (primer) 

320 330 340 350 360 

AGAACTgftft GTCCATGATTGAGAAGGTGACTGGGAAGAATGCCGTAAACAACTATGGCTT 

RclolO-l^(|?S^P^*^^"^^*^*^^**^°^^^^*^'^^*^*^^y 

370 380 390 400 410 420 

CTACGCCTGCTACTGTGGCTGCGGCGGCCAeCGG ACCeCTAASGATf;f:rAr7'f^ it^' n. rifl>rr>. 
eTyrGlyCysTyrCysClyTrpGlyGlyHisGly Th?Pr4£?sX£pGlyThrASpT ^^ 

Rcloio-2 (primer) 
440 450 460 470 480 

CTGTCGGATGCACGACCGTTGTTATGGGCTACTGGAGGAGAAACACTGTGCCATCCGGAC 
sCysArgMetHisAspArgcysTyrGlyLexil.euGluGluLysHisCysAlaIleArgTh 

490 500 510 520 530 540 

CCAGTCCTATGACTACAGATTCACACAGGACTTAGTCATCTGCGAACACGACTCCTTCTG 
rGlnSerTyrAspTyrArgPheThrGlnAspLeuVallleCysGluHisAspSerPheCy 

550 560 570 580 590 600 

^=^^?,'^?^^'^'^^^°'^°^°^°*'*='='^°^«5CTGGTCTACTGCCTGAGGAGAAACCTCTG 
sProValArgLeuCysAlacysAspArgLysLeuValTyrCysLeuArgArgAsnLeuTr 

SIO ^ 620 630 640 650 660 

GAGTTACAACCGTCTTTACCAGTATTACCeCAAgTTrrT' CTGgTAATffTrr^r ^fi'pn/^r.r- 

pSerTyrAsnArgLeuTyrGlnTyrTyrProAsnPheLeuCysEnd ^^^^^^^ 

RclolO-3' (primer) 
€80 690 700 710 720 

TCTCGCCGGGAGTGCCTCCCACAGTGGCGGCCCCCCTCGGCTGTATTCCTGATCCGTCCA 

730 740 750 760 770 780 

CCCAACGTCTTGGATCTGCCTTCCTCTGTGTACCACTGGGCTGCACAGAGCCCAGGCTTA 

''^O 800 810 820 830 840 

CACCCTACCCTCCAGAATCCTAGAGAGGGACTCTGATGTAGAGTCTGCGGACTCTGGATA 

RPLA 2 -10 cDNA sequence corresponds t.o SEQ ID NO: 29: and Derived 
Amino Acid sequence corresponds to SEQ ID NO: 30:. 
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850 860 870 880 890 900 

GCTGACCCTGCACTTGCAGAATTTGGCGCTGCGCCCCGGAGCTCCCTCAGCTCCAGGCCA 

910 920 930 940 950 960 

GTCTCGTGTTGACTTTCCTTTCAATTTCTGGAACCCAACTGCCATTACCACCCTCCAGAG 

970 980 990 1000 1010 1020 

ACCTCTTACTAGAGGAGAAGCCAAArrAACTCTATAAATCTGCCATGTAGCTATTAAATA 

1030 1040 1050 1060 1070 1080 

AAACCCATTCACGAGGCGAGAAGAACACCACCCCAGCACTCCCTCTGACAGGGCTGGGGT 

1090 1100 1110 1120 1130 1140 

AGGAGTGCCAATGCTTCTCTAACCCCTGAGGCATCTGTGCACCCTCTAGGATGGAGGTCA 

1150 1160 1170 1180 1190 1200 

GGAAACAGGTGGCGGCCTTACATGCCTTTCATG Gl TrTGTCTTGAGTrTAI I I TCTT AAAC 

1210 1220 1230 1240 1250 1260 

CTTAGGGTCTTTCAAGCCAGACCTGGAGCTCAAG A TT Ca 'TCTGGAGGAAGGTGAGACACA 

1270 1280 1290 1300 1310 1320 

GCCCTATGCCACCTTGAGCTCCAGGCTAGAAAGGGACAGCCCCTAGCCCTGGCTTCTGCA 

1330 1340 1350 1360 1370 1380 

ACTGTGTGGTCTTGAACCTCCGTATAGTCCGAATCCCTCTGGCTCTCCTCAAAATATAAA 

1390 1400 1410 1420 1430 1440 

ACAAGCCTCCTTCCAATAGCATATTGGTGCACACCCCTAATCCCATCACCTGGGAGGAGG 

1450 1460 1470 1480 1490 1500 

AGGCGGCAGGAGCATCAGGAGTTCAAGGCCAGCTCCTGCCCCCTAGCAGGGATGGTAGGC 

1510 1520 1530 1540 1550 1560 

TGCATGAGAGTGTGTCTCAGAAAGAACCACCTGGTGCGGGTACAGGGATGCTGGGATTCT 

1570 1580 1590 1600 1610 1620 

GAGATGTCACTCAGTGCGGGAAAAGATTCAAGGAGGGGAACAGATCAATGGCAGAATGAC 

1630 1640 1650 1660 1670 1680 

TGTCTGTGCCGAGTTAAGGGCACTGAAAATCTCAGCTCATCTATCGCTTTATAGAAGATA 

Ifi^O 1700 1710 1720 1730 1740 

GAGCTTTGGGAGGAAGCAAGGCACTCTACASa^AAGGAGTGGCCIT^ 

Polyadenylation site 
1750 1760 1770 1780 1790 1800 

TAGGCTCCTTCTTCTCCAGAACATCCACAGGACATAGGAGATCCATTATTTAGAGACCTT 

1810 

TCGTGTTCGAACG'ri'TTCT CCGGAATTC RPLA2 -10-1 

Clone linker 

aaataaacrttaartatatitaa gecQaaatrtc RPLA2 - 10 

Additional Polyadenylation site. Clone linker 
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The top sequence coaes froa RP1A2-10-1. The bottom sequence 

is from RPIA2-10-2. Both the sequences are identical except for the 5 

and 3" sequences indicated by the lower case letters. 
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Fig. 12 HPIA2-10 cDHA (Type IV) and Derived Amino Acid 
Sequenc 



10 20 30 40 50 60 

GGATACCAATGTTCCGACTGGAGACGGGGAGCCC6CGAGACCCGGGTCTCCAGGGTCTGC 
^0 80 90 100 110 120 

CCAAGGAAGTTGCTCATGGGAGCAGACCCCTAGAGCAGGATTTGAGGCCAGGCCAAAGAG 
130 140 150 160 170 lao 

AACCCCAGAGATGAAASG CCTCCTCCC A CTGGCTTGGTTCCTGGCrTGTA CTf;Tr;rrTf=r> 

MetLysGlyLeuLeuProLeuAlaTrpPheLeuAlaCysSerValo-oA"' 
HclolO-5' (primer) HclSlO-A (primer) 

Clone HPLA2-10-5 CCTCC. ... 

190 200 210 220 230 



220 230 94n 

^v^?^^^^?^^^^^^^^^^^^^G^TCgAGAAGGTGACAGGGAAGAACGC 
aValGlnGlyGlyLeuLeuAspLeuLysSerMetUeGluLysValThrGlvLvsAs^il 
Cloio-i (primer) clone HPLA2-io.7-~-a.ACG^ 

250 260 270 280 290 -ann 

CCTGAOIAACTACGGCTTCTACGGCTGTTACTGCGGCTGGGGCGGCCGAGGAACCCCCAA 
aLeuThrAsnTyrGlyPheTyrGlyCysTyrCysGlyrrpGlyGlyArgSl^EFifi^ 

_?20 330_ 340 350 360 



ACGC 
300 
CCCAA 




550 560 570 580 ^^^°llo'' ^^""^ni 

qcCIAGGCCTCCCCAGCGAGCTCCTCCCAGACCAAGACTTTTGTTCTGTTTO 
HclolO-3' (primer) 

610 620 630 640 650 ccn 

CACAGAGTACTGACTCTGCCTGGTTCCTGAGA6AGGCTCCTAAGTCACAGAC(nrCAGlCT 
^'0 680 690 700 7in non 

TTCTCGAAGCTTGGCGGACCCCCAGGGCCACACTGTACOTCCAGCGAGTC 

TGACTCTGGTCATAGGACTTGGTAGGG^IJcAGGGTC^CTAGGCCTcJI^ 
arrrrr-rZ^J/-H.^^r..,B?£ 820 830 840 



'^w oug 810 820 R-iO oin 

AGCCCCTCTGGTGCCAAGAGCTCTCCTCCAACTCAGGGTTGGCTGTGTCTCTTTTCT?CT 

860 870 880 fi90 onn 

ctgaagacagcgtcctggctccagttggaacactttcctgagatg(:acttacttctJ^ 

TTCTGCGATCAGATTATCATCACCACcl??CTC(::AGAGi^ 

^80 990 1000 1010 

ATTGACTCTCTAAATCTGGTGTATGGGTATTAAATAAAATTCATTCT 

Polyadenylation site 

AATAAA 

AACCACATTGGCATTTTC HPLA2- 1 0-3 Addi t i ona 1 

Polyadenylation sire 

HPLA2-10 cDNA sequence corresponds to SEQ ID NO: 31: and Derived 
Axnino Acid Sequence corresponds to SEQ ID NO: 32:. 
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Fig. 13 Comparison Bctveen' Deduced Amino Xeid Sequences of 
E7IJ^2"*10 and RPLA2*10 



1 MetLysGlyLe\iLeuProLeuAlaTrpPheLeuXlaCysSerValProAlaValGlnCly 

II 11 i I I I I I I I I I I I I 

1 MetLysArgLeuLeuThrLetiXlaTrpPheLcuXlaCysSerValProAlaValProGly 

2 1 G ly LeuLeiiAspLeuLy sSerMetl leG IxiLy s ValThr G 1 y Ly s As nAl aLeuThr As n 

i I I 1 I I i I I I I I I I I i : I 

21 GlyLetiLeuGliiLciiLysSerMetlleGlxiLysValTlirGlyLysAsiiAlaValLysAsn 

41 TyrGlyPheTyrClyCysTyrCysGlyTrpGlyGlyArgGlyThrProLysAspGlyThr 

I I I I I I 1 I I I I I : I I I I I 1 1 
41 TyrGlyPheTyrGlyCysTyrCysGlyTrpGlyGlyHisGlyThrProLysAspGlyThr 

61 AspTrpCysCysTrpAlaHisAspHisCysTyrGlyArgLeuGluGluLysClyCysAsn 

I i i i I I : i I I I I i I I 

6 i AspTrpCy scy s Ar gMe^HisAspArgCysTyrG ly LeuLeuG luG 1 uLysHis Cy s Al a 

8 1 IleArgThrGlnScrTyrl-ysTyrArgPheAlaTrpGlyValValThrCysGluProCly 

I i I I i I III: : I It 

a 1 IleArgThrClnSerTyrAspTyrArgPhcThrGlnAspLeuValllecysGluHisAsp 

101 ProPheCysHis ValAsnl*auCys AlaCysAspArgl^ycLeuValTyrCy sl^uLvsAra 

11 I M I I I I I I I I I I J T I 

101 SerPhcCysProValArgLeuCysAlaCysAspArgLysLeuValTyrCysLeuArgArg 

121 AsnLeuArgSerTyrAsnProGlnTyrClnTyrPheProAsnlleLeuCysSer 

i I III I I I : I I II 

121 AsnLcuTrpSerTyrAsnArgLeuTyrG InTyrTyrPr oAsnPheLeuCys 

Marches = 107 Mismatches » 30 Unmatched - i 

Length = 138 Matches /length « 77.5 percent 



Top and bottom lines are deduced amino acid sequences of HPIA2-10 and 
RPLA2 • 1 0 , respectively . 



Top line is SEQ ID NO: 32:; 
Bottom line is SEQ ID NO; 30:. 
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Fi?* 14 Cosparisoa B«tvettzi Deduced Xai&o Acid Secpjenee 

and Riiaan Typa I Aaiao Acid 8«quaae« 

1 Me^LysGlyLeuLeuProLeuAlaTrpPheLeuAlaCysSerValProAlaValGln 
I i I I II I : 

1 HeU.ys LeuLeuVall^iiAlaValLeuLeuThrValAlaAlaAlaAspSerGlylle 

20 GlyGly Leul*eu AspLeuLysSerMetllcGlu LysValThrClyLysAsn 

• 2 I I : I 

2 0 SerProArgAlaValTrpGlnPheArgLysMetlleLysCysVallleProGlySerAsp 

3 7 AlaLeuThrAsnTyrGlyPheTyrGlyCysTyrCysGlyTrpGlyGlyArgGlyThrPro 

4 0 ProPhel^uGluTyrAcnAsnTyrGlyCysTyrCysGlyLeuciyGiySerGiyTh^ 

57 LysAspClyTiirAspTrpCysCycTrpAlaHisAspHisCysTyrGlyArgLeuGluGlu 

> I II : I I 11 

60 ValAspGltU-exiAspLysCysCysGlnThrHisAspAsnCysTyrAspGlnAlaLysLys 

77 LysClyCysAsn IleArgThrGlnSeTTyrLysTyxArgPheAlaTrp 

I • 5 I : : : 

8 0 LeuAspSerCysLysPhcLeiiLeuAspAsnProTyrThrHisThrTyrSerTyrSerCys 

5 3 GlyVal ValThrCysGluProGlyProPhc CysHisValAsnLeuCysAla 

I -11 I : I 

100 SerGlySerAlalleThrCysSerSerLysAsnLysGluCysGluAlaPhelleCysAsn 

110 CysAspArgLysLeuValTyr CysLeuLy sArgAsnLeuArgSerryrAsnProG InTyr 

III t : II 

12 0 CysAspArgAsnAlaAlalleCysPheScrLysAla ProTyrAsnLysAlaHis 

13 0 G InTyr PheProAsnllftLeu Cys Ser 
13 8 LysAsnLeiiAspThrLysLysTyrCysGlnsir 

Marches « 45 Mismatches « 90 Unmatched » 16 

Length = 151 Matches/ length - 29.8 percent 

Top line is HPLA2-10 deduced amino acid sequence; bottom line is 
human type I amino acid sequence. 



Top line is SEQ ID NO: 32:; 
Bottom line is SEQ ID NO:36:. 
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Fig. X5 Coap&rison Be^vMn EFUIS^XO Deduced Aaino Acid fiequence 
aad Eumaa fza2 Type XI Xaino Xeid Sequence 



1 Hetl-ysGlyl^uI-euProLeuAlaTrpPheLeuAlaCysSerValProAlaValcinSH- 
i 1 I I I I : : r 

1 Me^ysThrLeul^uLexiI^euAlaVallleMctllePheGlyLeuLeuGlnAlaHisGly 

21 SlyLeuLeuAspLeuLysSerMctlleGluLysValThrGlyLysAsnAlaLeu-^-Asn 
' • s I I III!**' 

21 ^«»l^uValAsnPheHisArgMetIleLysLeuThrriirGlyLysGluAlaAlaLeuSer 

41 ^^jyP^^^^jyCysTyrCysGlyr^ 

41 ^GlyPheTyrGlyCysHisCysGlyValGlyGiyArgciySMpioLy^^ 

AspTrpCysCysTrpAlaHisAspHisCysTyrGlyArgLeuGluGluLysGlycysAsn 

I II =11 II III • I I 

AspArgCysCysValThrHisAspCysCysTyrLysArgLeuGluLysArgGlyCysGly 

8 1 IleArgrnrGlnSertyrLysTyrArgPheAlaTrpGlyVal ValThrCysGluPro 

8 1 ThrLysPhcLexiSerTyrLysPheSerAsnS« GlySerArglieThrCysAlaLys 

100 GlyJ*3roPheCysHisValAsriLeuCysAlacysAspA^ 



61 
61 



100 
120 
120 



GLnAspSerCysArgSerGlnLeuCysGluCysAspLysAlaAlaAlaThrCysPheAla 

ArgAsnLeuArgSerTyrAsnProGlnTyrGlnTyrPheProAsnl ser 

' ' * I i I i I : I 1 

ArgAs nl-ysThrThrTyr AsnLysLysTyr G InTyrTyr Ser AsnLy sHisCys ArgG 1 y 



14 0 SerThrPr oArgCys 

Matches « 63 Mismatches - 74 Unmatched « 8 

Length - 145 Matches /lengtih « 43.4 percent 

P?S2^4C^«^?T"ri?J:^L?5'*''"^ "^'^'^ ''^^^ sequence; bottom line is human 
^1^2 type II amxno acxd sequence* 

Top line is SEQ ID NO:32:; 
Bottom line is SEQ ID NO: 37:. 
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Tig, 16 . CoaparisoB Be'tvaeo Deducsd Xaiae Acid Saquenees of 
RPLX2-10 and Rat Typm XI Xaiae xeid fiequesee 



a MetLysArgLcuLeuThrLcuAiaTrpPheLeuAlaCys SerValProAlaValPro 

I I I I i t : I 

1 MetLysValLcul^ul^uLauAlaValVallleMetAlaPheGlySerlieClnValcin 

20 GlyClyl'ful'euGlul^uLysSerMetlleGlul.ysValThrGlyLysAsnAlaValLys 

I'll I i I I I I 

21 ClyScrLeuLeuGluPheClyClnMetlleLeuPheLysTiirGlyLysArgAlaAspVal 

4 0 AsnTyrGlyPheTyrGlyCysTyrCycclyrrpClyGlyHisGlyThrProLysAspGly 

I i I I I I II I i : I : I I i 

4 1 SerryrGlyPheTyrGlyCysHisCysGlyValGlyClyArgGlyserProLysAspAla 

60 TteAspTrpCysCysArgMetHisAspArgCysTyrGlyLeuLeuGluGluLysHisCys 

61 ThrAspTrpcysCysValThrHlfiAipCysCyslVrAsnArgLiuGiuLysA^gGlycis 

80 AlaIleArgThrGlnScrTyrA«pTyrArgPheThrGlnAspLeuVallleCysGlu 

8 1 Oiyri^-I-ysPheValThrTyrLysPheSerTyrArgGlyGlyGlnlieSercisSerTtir 
99 His^spSerPheCysProValArgLeuCysAlacysAspArgLysLeuValTyrCysLeu 
101 ^n^SinAspSerCysArgLysGlnLeuCysGlncisAipLysAlaAlaAlaGluCysPhe 
119 ArgArgAsnLeuTrpSerryrAsnArgLeuTyrGlnTyrryrProAsn Phe 

121 AlaArgAsnLysLysScrTyrSerLeiiLyslVrGinPheTyrpi-oAinLysPheCysLys 

13 6 Leu Cys 

14 1 GlyLysThrProSerCys 

Matches « 62 Mismatches - 75 Unaatched - 9 

Length = 146 Matches /length - 42.5 percent 

Top line is RPLA2-10 deduced aaino acid sequence; bottom line is 
rat PLA2 type II amino acid sequence. oa-^i^am xine is 



Top line is SEQ ID NO: 30:; 
Bottom line is SEQ ID NO: 35:. 
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Fig. 17. comparison B««v*«n DeAuead Amiae Xcid Sequences of 
RPLA2-10 and R?ZA2»S 



1 MetLysArgLeuLeuThr Leu Ala Trp PheLeuAla 

I Is: : : | : 

1 MetAspLeuLeuValSer SerGlyMetLysG ly 1 1 eAlaVa IPIicLeu Va IPh e 1 1 ePhe 

13 Cys ScrValProAlaValProGlyciyLeul^uCluLeuLysSerMetlleGluLys 

21 cysTrpThrThrScrThrLiu SerSerPheTrpGlnPheGlnArgMitviiLysHis 



32 
40 
51 
59 
71 
79 



ValThrGlyLysAsnAlaValLysAsnTyrClyPhe TyrGlyCysTyrCysGlyTrp 

:||: I I • llllll 

neTliTGlyArgSerAlaPhePheSerryr TyxClyTyrGlyCysTyrCysGlyLeu 

GlyGlyHisGlyThrProLysAspGlyThrAspTrpCysCysArgMetHisAspArgcys 

' ' • ' I I II II 111 

ClyGlyArgGlylleProValAspAlaThrAspArgCyscysTrpAlaHisAspCysci's 

TyrGlyLeuLeuGluGluLysHisCysAlaZlcArgTttrGlnSerTyrAspTyrArgPhe 
TyrHisLysLeuLysGluTyrGlyCysGlnProIleLeuAinAlaTyrClnPheAlalle 
91 ThrGlnAspLeuVallleCysGlu His Aspser PheCysProValArg 

ValAsnGlyThrValThrCysGlycysThTMetGlyGlyGlyCysLeuCysClyGlnLys 
LeuCysAlaCysAspArgLysLeuValTyrCysLeuArgArgAsnLeuTrpSerTyrAsn 
AlaCysCluCysAspLysLeuSerValTyxCysPheLysGluAinLiuAlaThrTyrGlu 
127 ArgLeuTyr GlnTyrTyrProAsnPhe Leu cys 

1 3 9 LysTiirPheLysGlnLeuPheProThTArgProGinCysGlyArgAspLysLiuHiscis 

Matches « 48 Mismatches - 87 Unmatched « 25 

Length - 160 Matches/ length ■ 30.0 percent 

Top line is RPLA2-10 deduced amino acid seguenoe; bottom lin- i« 
RPLA2-8 deduced amino acid sequence. nottoin ime xs 



99 

107 

119 



Top line is SEQ ID NO:30:; 
Bottom line is SEQ ID NO:22:. 
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rig. 18 Conparisoa Between Deduced Amine Acid Sefue&ee of 
RP1A2-10 end R*t VZX2 Type 1 teiao Acid Se^ueBce 



1 

1 

i9 
20 
37 
40 

57 

60 

77 

80 



MetLysArgLeuL«uThrI.euAl«TrpPhel.euAlaCysSerVal Pro AlaVal 
MetLys LeuLeuLeuLeuAlaAlaLeuLiuThrAlaClyvilThrAlaHisSerl le 
ProCly GlyLeuLeuCluLeuLysSerMetlleClu LysValThrGlyLysAsn 
SerThrArgAlav;iTrpGlnPheA;gAsnMetIleLysCysThrlieProGiySerAsp 
AlaValLysAsnTyrClyPheTyrGlycysTyrCysGlyrrpGlyG 

ProLeuArgGluTyrAsrw^nTyrGlycisTircisclyLeuGiyclyserclyrirpio 

LysAspGlyThrAspTrpCysCysArgMetHisAspArgCysTyrClyLeuLeuGluGlu 

ValAspAspLeuAspArgCysCysClnThrHisAipHlscisTirAsnClnAlaLySLys 
LysHisCysAla IleArgThrGlnSerTyr Asp TyrArgPhe 

LeuGluSerCysLysPheliuIleAspA^nProT^AsnThrr^SerT^sCys 

ThTGlnAspLeuVallieCys CluHisAspSerPhecysProValArgLeuCysAla 
SerGlyAsnVallleThrCysSerAipLysAsnAsnAspcisGluSerPhelieclsAsn 
CysAspArgLysI^uValTyrCysLeuArgArgAsnLeuTrpSerryrAsnArgLeuTyr 
CysAspArgGlr^laAlalleCysPheScrLys vll Pro^rAiraysGluT^ 
- ^ ° G InTyrryr ProAsnPhel<euCys 

13 8 LysAspLeuAspThrLysLysHisCys 

Top line is SEQ ID NOi30r; , 
Bottom line is SEQ ID NO: 34:. 



91 

100 

110 
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Fig. 19 Human Senomic HPLAt-S Sequence 

(1/15) 

SEQ ZD MO: 33: 



10 


20 


30 


40 


50 


- 60 


AA G CTTTG T3 


GGATTTCTAT 


TATGAACAAC 


ATAGGTGCCT 


TTCCAACTCG 


GGAACAGAGG 


70 


80 


90 


100 


110 


120 


AAATATGGAC 


TCCTCAAAAG 


AAAAAAAGAA 


GAGATGAAGG 


GATGATGTTG 


CCAAAGAAAG 




14 U 


ISO 


160 


170 


180 


AAATTTGGAA 


AAAAAAAAAC 


CAAACCAACA 


TTTGCACTTT 


CAAAACCATG 


GAACCCTTCT 


190 


200 


210 


220 


230 


240 


TATTTTTATA 


TGTTCAGATC 


TAAATGCCAG 


AAAGGTTACC 


ACATTCAAAG 


GGAATGAGAT 


250 


260 


270 


280 


290 


300 


TTGAAAATGA 


TTTCTTTGAG 


TCCTCTGCTG 


AGGTCTTT CC 


AAGGCACTAC 


AATTAGGGCT 


3X0 


320 


330 


340 


350 


360 


TTGCACCCAA 


ATACCCTTGC 


CTCATTTTGG 


TCATTTTTGT 


CCTGGAACAG 


AGGTTCAGCT 


370 


380 


390 


400 


410 


420 


GGGAGACCCC 


TCACACACAG 


GTGAAGGCGT 


GGCTGTAGAA 


CCTCAGACCC 


C C7G GTCTCC 










£xon 


1 ? 


. 430 


440 


450 


460 


470 


480 


TCAGGAATPA 


AGGTCATTGC 




jCTCCTCCTCT 


TCTGCTGTAA 


GTAGAGAGCG 


490 


500 


510 


520 


530 


54 0 


TTGGTGGGTC 


AGCACCAAGC 


TTCTGTCTTC 


CTGTTTATGT 


CAGTGGGAGG 


GGCGACTCTC 


550 


560 


570 


580 


590 


600 


CAGGTGGCAC 


CAGGTGAGGG 


AAGTCACAAG 


TCCCCCAGAA 


AAGAATCAGG 


AAAGGAACGG 


610 


620 


630 


640 


650 


660 


GCTCCCACCA 


ACGTCCTCTT 


GCTTCTCTTT 


CTGCTATAAA 


ATGGGCTGAT 


CCCAGTGTTG 


670 


680 


690 


700 


710 


720 


GCATCTTATA 


AAGTGTCTAG 


GAAATCAGAG 


GTTGCCAACC 


ATTTGCTAGA 


AAGGGAGTTT 


730 


740 


750 


760 


770 


780 


GAGTAGTATT 


TTACCCCCCC 


TCACCCTCAA 


GAGTCTTTTT 


ACTTTGGATG 


CTAGTAGCCT 


790 


800 


810 


820 


830 


840 


TTTATTTAGG 


CATTGGATCA 


GAACAAAAAT 


GCAGGACATA 


TATCCAGCCT 


AATTTAACCA 


850 


860 


870 


880 


890 


900 


ATGGATTAAA 


TGGCCTTATC 


AGGAAAAGAC 


CATTTTATGG 


TGACTTATGG 


GATAATTGGT 


910 


920 


930 


940 


950 


960 


AGTTATAAGT 


CATTGCTGCC 


GGGAGATCCG 


ATTGCTTACC 


TCTGCAAAGT 


GAAGAAAGAC 


970 


980 


990 


1000 


1010 


1020 


CTACTGGGAA 


ACAGTTTGGG 


GTCTACTGGA 


GACTGATAGA 


CTCTTTTGCT 


GGATTCGTTC 
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FIG. 19 (2/15) 

1030 1040 1050 1060 1070 108 0 

AGTCGAGGTT TCTCCAGATC CATTTTCCTG TCTCTTTCAA TTGAGTCACA ATAACTTTTG 

1090 1100 1110 1120 1130 1140 

j:.GTCCCTAAG TCAAAGATGT CAAAAACAGA CTTCCTTTCC CCACAGTGAG TGGTGGAArT 

1150 1160 1170 1180 1190 1200 

TACACTTTGC AAGGTGATAC TGCACGACGA TACCTGTACC CACGGATGAC CGCCTCTGCA 

1210 1220 1230 1240 1250 1260 

GCCCTCAGTG CGGCTCCAGG ACTGCTTGCG CACCACTGAC CGCCCCATGG GTTTCTTCCG 

1270 1280 1290 1300 1310 1320 

CCACACCCCC GTTTAGACTG AACACGATAG GTAGATCGAA GGCCACCTGA GAAAACTCCC 

1330 1340 1350 1360 1370 1380 

CCAAAACTCT ATTTCT G TTT CTCTTCTTCA TUIGTTCATGT CTTTGTTGTA TTTTTATTGC 

1390 1400 1410 1420 1430 1440 

AAATTTACTA CATGCTTATA GTTAAAAAGT AAAATAAATG AGTA^i'ATAGC AACAAGGTAA 

1450 1460 1470 1480 1490 1500 

AGCTCCTCCT CATCCTCCCC AGACCCCAGT TTTTTCCCTA CATCCAGATG TGACCACTCT 

1510 1520 1530 1540 1550 1560 

TAAGAGTTTG ATATACATCC TCTATACAGC GTTTACCACA CACACATTCA AAACACCATA 

1570 1580 1590 1600 1610 1620 

ATAGGAAGGG AACACATCCT GGGCCGGGCG CGGTTGTTCA TGACTATAAT CCCAGCACTT 

1630 1640 1650 1660 1670 1680 

TGGGAGGCCG AGGCGGGCGG ATCACCTGAG GTCAGGAGTT CGAGACCAGC CTGGCCAGCT 

1690 1700 1710 1720 1730 1740 

GGCAACATGG TGAAACCCGT CTCTATTAAA AATACAAAAA ATTAGTCAAG CATGGCAGTT 

1750 1760 1770 1780 1790 1800 

GGGCACCTGT AATCCCACCT ACTCACGAGG CTGAGGCAGG AGAATTGCCT GAACCCGGGA 

1810 1820 1830 1840 1850 1860 

GGCGGAGGTT GCAGTGAGCC GAGATCACAC CATTGCACTC CAGCCTGGGT AACAACAGCG 

1870 1880 1890 1900 1910 1920 

AAACTCCGTC TCAAAAAAAA AAAAAAAAGA AGGAAAGGGA CACACGCTTA TTATGAAAGA 

1930 1940 1950 1960 1970 1980 

CATGAGACAG CCGAGACGTG TATAAATGAT GTTGCCTGTT TTCTTTCTCT CTCTTCATCC 

1990 2000 2010 2020 2030 2040 

ATGCTAGAGA TAGTGCTATC AAATGTACTT ATTTTTGAGA CACATATTTC GTATTATCCC 

2050 2060 2070 2080 2090 2100 

TGTCGTGACA TGTCGGTGGT TTCCAATTTT TTGATATCAC AGATAATGCT TCAGGAAACC 
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FIG. 19 (3/15) 

2120 2130 2140 2150 2160 

attttgtctJ tcsatt^ctg cccactctca taagcatctt gtagaagcaa.aaacagctga 

2170 2180 2190 22'00 2210 2220 

GTTCATGTGT ACTTGTCATT TAAAAAAATA ATAATTGAGG ATACC.ww .-^w.w.i^ 

2230 2240 2250 2260 2270 2280 

GTATTTTGTT TCTCCTGTGA GATACTAAAG GCCTGATGAC ATCTGGAGGG ACTGGCGTTT 

2290 2300 2310 2320 2330 2340 

CTGGCTTTGA ACTTTTGCCA TTCATGTTGC ATCAGACCCG AGGGTGTTCT GCCTAGAACT 

2350 2360 2370 2380 2390 2400 

^T^QQrncrT gctttgagcg ggaasactat ggttgatggc aaagccttgt tctgaacctc 

2410 2420 2430 2440 2450 2460 

atggaaactg ggtattcatc tgccttacca aaaaactagc tgtcttacag gggcaaatct 

2470 2480 2490 2500 2510 2520 

gaacctattt tattccccag gaaagagcct ggtgattcca gccatgcccc ttgcacttcg 

2530 2540 2550 2560 2570 2580 

ctttggggat ctggtgatat ttccaatgct cagcactcta gtaaggggag gggacatcaa 

2590 2600 2610 2620 2630 2640 

ggcagcatca tgctcattgc aacttccttc ttcctttttt tctcatcggt gg tggc^gcc 

2650 2660 2670 2680 2690 2700 

ggcAcccA r ^ r.g&cTTTTT^ ceAeT T-reAS aggagggtca AACACATCAC ggggcgaagt 

2710 2720 2730 2740 2750 2760 

r.r rTTCTTrr CXTATTf.C r.r, ATATSGCTCe TAOTGTGGGC TTGGGGATAA AGGGATCCCC 

Exon 2 

2770 2780 2790 2800 2810 2820 

c:TGCATGACA CTGACAG GTG ggtgcagagg ctctaaggcc acttatcatt tgttttgcat 

2830 2840 2850 2860 2870 2880 

TAAAGTTCAT GCTCAAAGCC AGAGAGACGG TCTTAGGATT CTTGCCTGGC AAATAACAGA 

2890 2900 2910 2920 2930 2940 

AAACAACTCA GGCTAATGGA AGGAAGAACT GAACGGGATT TGGAGGATGG GTCTTGAGAA 

2950 2960 2970 2980 2990 3000 

ACCCAGGGTC GGGGCCAGCT TCTTGAGTGT GTGACCTGTC AAGTTTCACA GGCCCCAACA 

3010 3020 3030 3040 30 50 3060 

CTCATAAGGG TCAGGGCCAG CTTCTTGAGC CTGTGATCTG TAAAGTTTCA CAGGGCCTGG 

3070 3080 3090 3100 3110 3120 

CACTCATAAC CCCCTAAACA TGGTXTACTG CTCTGCTGCC ACATCTTGAA ATTCTTAATA 
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FIG. 19 (4/15) 

3130 3140 3150 3160 3170 3180 

AAGCGCCTCA TGTTTTCATT TTGCTTTACT CTCTGCAATT ATGCCGTTGG TCCTGCCCAG 

3190 3200 3210 3220 3230 324 0 

AGCTCTAGAA GCTGTTTCAT CCTCATAGTA AAAGTGCTCT GCTTTCAGCT CTCCAGui.T 

3250 3260 3270 3280 3290 3300 

TAGCACTATA CCCACAGCAC AACTGACTCA CTAGTCCTAA TTCCATATTC TGGAGAGGGC 

3310 3320 3330 3340 3350 3360 

TCCAAAGTGG CCCACTTTGG AGAAGTTGTC CATCTGGGTG AGGTTGCATG GCACAAACCT 
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11050 11060 11070 

AAAGATGCAA GATCTGGACT TAAGCTTGTT 

11110 11120 11130 

CTTGGCCTGG GGCTGTCTCA TCATCCAGGT 

11170 ' 11180 • -11190 

GCCACAGAAG TGACAGTCCA CTGCAAAAGA 

11230 11240 11250 

GGCAATGGGG CAGGCCACTG AAGTAGAACT 

11290 11300 11310 

GGAAGACCAA ATGAGAAAGG GAGAGGGGGC 

11350 11360 11370 

AGGCAAAGGA AACAAGAGAT GGAATAGAAG 

11410 11420 11430 

ACAGAAAGAG AGAGGGACGA GAAAGAAGGT 

11470 11480 11490 

GGCGACAGCA ACTGGACCAG CCCCTCTAGT 

11530 11540 11550 

CTCAAGTAGC CCTCCTCCCC TGAGAGAGTG 

11590 11600 11610 

TCTGGAAGGA TCTTCACAGA GGCAGGAGGG 

11650 11660 11670 

TAGAACAGTG GGAGTGGCGG GACGCAAAAC 

11710 11720 11730 

AGATTCCCCT GCG6CCGAGC AGGAGGGCAG 
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10720 10730 10740 

ACCTTGAGTT TGAGACACGT GTCAGACATG 

10780 10790 10800 

TGAAGTCTGG AGCGAAGGGA 

10840 10850 10360 

TTCTGACACT GACCAAGAAC ACCCCTCAGA 

10900 10910 10920 

GGGCTTAGAC TGGGGCCTGG GACAGCCACA 

10960 10970 10980 

ATTTGTGGAG ATGTAGTTCA TGCAGCAAGA 

11020 11030 11040 

TGATGCCTCA GTTTCTCCAT TAGCACATTA 

11080 11090 11100 

TTTAAAAGGT CGCCTCCAAA GACGGTTTTT 

11140 11150 11160 

CATGACACGC CCGGTCCATG GTTGAGGAAT 

11200 11210 11220 

CTGCTGCTCC AGATCAGTTC TGGAAGGCCT 

11260 11270 11280 

GGATGTCAGA TGCACGCATT AGAAAGGACA 

11320 11330 11340 

AGGGAGAAAG GAAGGAGAGC TAGAGACTTG 

11380 11390 11400 

AAGACAGAGG ACCAGAAGAC AGTGAGACCA 

11440 11450 11460 

GGCTGAGGAA GGTGAGAAAA GTGTTTCCAG 

11500 11510 11520 

TGGACACTGA GGCTGGCTGG GGGGCCTGAG 

11560 11570 11580 

GGGGCTACCT GGGGAGCTGG GCTTGATGCA 

11620 11630 11640 

GGAGTGCGAG GGCAGAGGGC ACCCAGGCGC 

11680 11690 11700 

CGGAGAGCCA GAGGAGTGAA CATCCCTGGC 

11740 11750 11760 

GAAGCTCAGT CGTGTTGGCA CAACGTGAGA 
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11770 11780 11790 

AGTTCCAGGG AGGCGTGGGA GGACGGCTTC 

11830 11840 11850 

GGCAAACAGA CTGACTGCAG GCACCTCTGC 

11890 11900 11910 

GGTGGAGCTG GAGTGCATCA CCCTGAGAAC 

11950 11960 11970 

CGTGCCAGGC ACATCCGGAC CACTTGTCGG 

12010 12020 12030 

GGAATGGGAC AGTGTCATGT GTTTCTGAAA 

12070 12080 12090 

AGGAAGCTGA AGCTAACTTG CCCAACGTCA 

12130 12140 12150 

CACTGCCAAC CCTGACAGCC GACTAGGGTG 

12190 12200 12210 

AGAGCCCTAA ACACACATGC GCCCGCACCA 

12250 12260 12270 

GCACACACAT GCGCACCCAC ATACACACAC 

12310 12320 12330 

AAGCATACAC ATGCACACAG GCACACGCAT 

12370 12380 12390 

ACATGTGCAT GCACACAGTG CGACAGCTCT 

12430 12440 12450 

AGTGGTGACT CGGCCAAAGT GCAGACACTG 

12490 12500 12510 

TCCCTTCTCT TATTCTTCAT TCATGGATTC 

12550 12560 12570 

CCTGGATGCT' GGGGATACTC TGATGACTTA 

12610 12620 12630 

TTCTTTGAGG GGAGGCAGAC ACAAGCCAGG 

12670 12680 12690 

CAGTGAGTGA GACAAACGGG TACGGAGGAC 

12730 12740 12750 

CAGGGAGCAC CTCTCTGAGG AGGTGAAATT 

12790 12800 12810 

CACTGCTTGC AGATCCTGGG GAGAACCATT 
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11800 11610 11820 

TGCAGGACGC AGACTTTGCA GAGGGAGAGT 

11860 11870 11880 

CSGCTCCACA GG3CGCTGCT TTTTCTCCAC 

11920 11930 11940 

CAGCAGCAAG CCCCCACAGG GCACCTTCTG 

11980 11990 12000 

TAGACACCAG TGACCCTCAC CACCACCCCA 

12040 12050 12060 

TGACTAGGTT TTAGCACCAT TTCATAGATG 

12100 12110 12120 

TAAACCGGGC GTCTGGTGGC CTCCCCTCCT 

12160 12170 12180 

GAGTTATCTG GAAAGAGGAA GCTGTACCTG 

12220 12230 12240 

CACACACACA CGCACAAACA CACAATGCAC 

12280 12290 12300 

ATGCACACAT GGACACATAC CTGCACACAC 

12340 12350 12360 

GCACACACGC GCATGCACAC ACATGCACAC 

12400 12410 12420 

GATTAGTAGG TAAATAAAAG GTTCCCATCT 

12460 12470 12480 

AACCCCAAAG GCCCATAGAG GCTTCATTCA 

12520 12530 12540 

TATTGAGCAT CTGCTCTGTG CAGCATCTGT 

12580 12590 12600 

GACAAGGTCT CAGCCGCACA CAGCTTATGC 

12640 12650 12660 

AAACCAATAA GAGAAGTTAA GTAAAAAGCA 

12700 12710 12720 

ATGGCCAGAG AGAGCTTTAG TTCAGGTGGT 

12760 12770 12780 

TGACCAAGCC TCAAACAGTG GCAGGGATCC 

12820 12830 12840 

TTAGACAAAA AGAACAGCAA GTCCAAAGGC 
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12850 12860 12870 12880 12890 12900 

CCACAGACAA GACAGAGCAA GACCTCTGAC ATGAAACAGG CTCGTGTGCC CAGAGCAGCG 



12910 12920 12930 12940 

AGGCTGCGAG AGTCGAGGGG GAGGCCGATG AGGGTGGAGA ASC. 



12950 



12960 



* WW WAX Wta 



12970 12980 12990 13000 13010 13020 

CGGCAAGTGT GCCTGGCCAC CGAGGCCACG GAACGATTCA GCATGTCTTT CCCGAATACG 

13030 13040 13050 13060 13070 13080 

AACCACACTC GGCTGTAACA CAGACTGACC TACTCGGTAC GTTGAGAAGG TCCTGCTTAT 

13090 13100 13110 13120 13130 13140 

TTCCTTCCGT GAAGGAGGAA GACCTGCTGA TGACAGAGAT TGGCAGTGGC CAAAGACATA 

13150 13160 13170 13180 13190 13200 

GAGAGAAGAG GGCAGAACAT GGGCTATTTT AAACACAGAG AAGATTAGCG GGACCCGCTC 

13210 13220 13230 13240 13250 13260 

GCAGACCGGA CGTGAAATGT GGAAGCAGCG GCGCCACCCA GSTCGGCTCC TAGTTTCCTG 

13270 13280 13290 13300 13310 13320 

AGAATGTGGG TGAATCACCG CCTCACACCC AGACCGACCA CTAGGATATC AAGGGTTCCC 

13330 13340 13350 13360 13370 13380 

TTGTGAACGC CTCAAGTTCG AGATGCCTGA GACATCCAAG TCAGATGTCA AGCAGGCAGC 

13390 13400 13410 13420 13430 ii^an 

TGGAAATAGG AGATGACCTC TGGGAAAATG CTCCCATCAC CCTGGCCTGT GTGCTGCCTG 

13450 13460 13470 13480 13490 13500 

GGCGCACCCA TTCAGCGCCC TCCACGCAGC CCACGCCCCT GCCTCCTGAT TCCTTCTAGG 

13510 13520 13530 13540 13550 13S60 

CTTCTCCAGC ACTCGTGGGA TGCCCAGATG TGATCAGGGA AGGGCTTGAG GATGCAGGGA 

13570 13580 13590 13600 13610 T»«:>n 

AGCTGTGGCT GAGAGCCCTA AACACACACA TGCACACGCA CACACACATA CACAGGCAcJ 

13630 13640 13650 13660 13670 l-lfiao 

TGCACACACG ACCATACACA CACACAAATG CACGCAGATG CACACAAATG CATATGCATC 

13690 13700 13710 13720 13730 

CACACAAATG CATATGCACA CACACACATG CACACATATG CATACACGTA TCCCTTTCAG 

^"^50 13760 13770 13780 13790 nnnn 

TGGCTTTCCT TTCTGTCCTT AACCCTTGGC CCCTTACAGT GAGCTCCciS TTCTCCc"2 

13810 13820 13830 13840 13850 i^Bcn 

CCTTAGAACC AAACCCTGGC GCTGGGCTGG GAGCCCCCAG TGACCCTCTG TGTCTCTGtI 

13870 13880 13890 13900 13910 13920 

G GTGGATCCA CCCTTGGTCC TGCrrGggAr.e -rnr-n ^ rTaciA r^r>r"legc CTGTc"?g>r 
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13930 13940 13950 13960 13970 13930 

SACAASCAAT CggTCSgAeTC CTTgAAAGAS AgC CTGCCCA CCTATGAGAA AAACTTra A.-. 

Exon 4 



14 Z-i 0 



13990 14000 14010 14020 1403C 

CAGTrCTCCA GCCGGCCCXG GTSTGGCAGA CATAA'CTCrT GCTSCTAG CG ACA: 

14050 14060 14070 14080 14090 14100 

GTCCCTCTCA TCATCCAGCA TCCGCTCTAC TGTTGCTCTT CCAGGAAGCC TTCTCAGATC 

14110 14120 14130 14140 14150 14160 

ATCCCCAACA GGCCCCTGTT CTTCCACTGG GAGCGAGCAC AAAATGTCTC CCGCAGGGCA 

14170 14180 14190 14200 14210 14220 

GCTCACCCTT CAGCATTCTC ACCAACCGCA CTCCCTCTCC TTCAGCATCA GAGGGCTGCA 

14230 14240 14250 14260 14270 14280 

GAGCAGAAAT GGGAAAGATC AGATCCCTCC CCTGCAGGAG CTGGCATTCT GTGGAGTGGG 

14290 14300 14310 14320 14330 14340 

GAGGACTACA AATGCATGGA TATAGAAGTA AGAGACACAT TAGACTGTAG TAAGTGCTAT 

14350 14360 14370 14380 14390 14400 

GATGCAGTAA AACAAAGGGA CCCGATAGAG ATGCACCCAA CCCCACATCC CAGGCGTTTC 

14410 14420- 14430 14440 14450 14460 

CAGGAGGGGA GAAGCCCCAG GATCTACCCC AAACTCTCTC TTCACCCCCA CTGCAAACCG 

14470 14480 14490 14500 14510 14520 

GGACACAGAG CAGACTTCAG CGCCAGGCCC ATGCCCAGCT CTAGCTGGCA ACAAAGCCAC 

14530 14540 14550 14560 14570 14580 

CACTTTCCTT gcccctctcc gtcctcagtt tttatgatgt cattcttagc ttttcttatc 

14590 1460 0 14610 14620 14630 14640 

AAGAGGCAGA ATCTCTTTTC CCCATCCCAT CAATCTGAAC TGGTCTTGTG GCTTAGTTTG 

14650 14660 14670 14680 14690 14700 

GTCAATAGAA TGTTGTCGGA GGGATGGTTT ACCAGTTTTG AGCTAGGCCT CAGGAGGTCT 

14710 14720 14730 14740 14750 14760 

AGGGCATGTC TACTCTCTCT TAGGACAGCT GCCCCCACCC TGCAAAAAAG CCTGGGCTAG 

14770 14780 14790 14800 14810 14820 

CCTGCTGGAC GATGAGAGCC CACCTGGATC AGTTGTCTCA GCTGATTTCA GACACGTGAG 

14830 14840 14850 14860 14870 14880 

AGAGAGCTCA GCGAGACTCA GCTTGTAGCT GACTACAGAT GTGTGAGGGA ACCTGGCTGA 

14890 14900 14910 14920 14930 14940 

GACCAAAACA ACTCTCCACC TGAGCCCACG CTAAACTGCC AACATGCAGA ATTGTGAGCT 

14950 14960 14970 14980 14990 15000 

AAATAAAGGC TGCTGITCTA ACTCACTGGG TTTTGGTATC GTTTGTTAGG CAGCCATAAC 
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15190 
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15220 


15230 
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15250 


15260 


15270 


15280 
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Fig. 20 Diagram of Vector to Express Dicistronie xnRHA. 
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PLA2 ACTIVITY 




CELL LINES 
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1 50 
Human Type I AVWQFRKMIKCVIPGSDPFLEYNNYGCYCGLGGSGTPVDELDKCCQTHDN 
Human Type II NLVNFHIMIK-LTTGKEAA1£YGFYGCHCGVGGRGSPKDATDRCCVTHDC 
HPLA2-10 GLLDLKSMIE-KVTGKNALTNYGFYGCYCGWGGRGTPKDGTDWCCWAHDH 



51 100 
Human Type I CYDQAKKLDSCKFLLDNPYTHTYSYSCSGSAITCSSKNKECEAFICNCDR 

Human Type II CYKRLEKR-GC GTKFLSYKFSNSGSRITC-AKQDSCRSQLCECDK 

HPLA 2-10 CYGRLEEK-GC NIRTQSYKYRFAWGWTC-EPGPFCHVNLCACDR 

** * * * 

101 133 
Human Type I NAAICFSKAP — YNKAHKNLDTKKYCOS 
Human Type II AAATCFARNKTTYNK-KYQYYSNKHC RGSTPRC 
HPLA2 " ^ 0 KLVYCLKRNLRS YNP-QYQYFPNILCS 



Alignment of amino acid sequences of human type X, II and 
HPIiA2-10 PIA2. Asterisks denote eighteen residues that 
have been conserved among all active PLA^ sequences. 
Underscored residues denote the amino acid COOH-terminal 
extensxons. 



Top line is SEQ ID NO: 38:; 
Middle line is SEQ ID NO:39:; 
Bottom line is SEQ ID NO: 40:, 
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FIG. 24 
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FIG. 27 

1 46 
RPIA2-Type I AVWQFRNMIKCTIPGSDPLREYNNYGCYCGLGGSGTPVDDLDRCCQ 
RPIA2-Type II SLLEFGQMIL-FKTGKRADVSYGFYGCHCGVGGRGSPKDATDWCCV 
RPLA2- 8 SFWQFQRMVK-HITGRSAFFSYYGYGCYCGLGGRGI PVDATDRCCW 

RPLA2 - 1 0 GLLELKSMIE-KVTGKNAVKNYGFYGCYCGWGGHGTPKDGTDWCCR 



47 !• 92 

RPLA2-Type I THDHCYNQAKKLESCKFLIDNPYTNTYSYKCSGNVITCSDKNND — 

RPLA2-Type II THDCCYNRLEKR-GC GTKFVTYKFSYRGGQISCS-TNQDS- 

RPLA2- 8 AHDCCYHKLKEY-GC -QPILKAYQFAIVNGTVTCGCTMGGGC 

RPLA2 - 1 0 MHDRCYGLLEEK-HC AIRTQS YDYRFTQDLVICEHDS F 



93 137 
RPLA2-'Type I -CESFICNCDRQAAICF — SKVPYNKEYKDL-DTKKHC 
RPLA2-Type II "CRKQLCQCDKAAAECFMWKKSYSLKY-QrYP-NKF CKGKTPSC 
RPLA2^ 8 LCGQKACECDKLSVYCFKENIATYEKTFKQLFPTRPQ CGRDKLHC 
RPLA2 - 1 0 -CPVRLCACDRKLVYCLRRNLWSYNRLY-QYYP-NFLC 



Alignment of amino acid sequences of rat Type I» II. RPLA2-8 and 
RPLA2-IO PLA2S. Asterisks denote eighteen residues that have been 
conserved among all active PLAg sequences. Underscored residues 
denote the amino acid COOH-terminal extensions. 



RPLA2''*^yP^ ^ sequence shown corresponds to SEQ ID NO: 41:; RPLA2"^P® sequence 
shown corresponds to SEQ ID NO:42:; RPIA2-8 sequence shown corresponds to SEQ ID 
NO: 43:; RPIiA2-10 sequence shown corresponds to SEQ ID NO: 44:. 
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and methods of III. The protein can be isolated from tissues, or produced synthetically. Furthermore, the proteins of I 
are not required for the compositions and methods of III. 

Group II is distinct from each of III and IV, because the nucleotide sequences of II can be used for several different 
purposes. Besides the methods of III and IV, the sequences of II can be used as hybridization prc^>es for isolaUon of 
related genes. 

Groups III and IV are distinct, each from the other, because the vectors, cells and methods of SI are not necessary for 
the methods of claims 62-68. The method of claim 59 requires reagents and procedures not required by the methods of 
III, and the methods are not obvious variants. Furthermore, the methods of IV are not required for the production or 
use of the compositionsand methods of III. 

Accordingly, the claims arc not so linked by a special technical feature within the meaning of PCT Rule 13.2 so as to 
form a single inventive concept. 
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